A new method for GAN-based data augmentation for classes with distinct clusters

Kuntalp, MEHMET; Düzyel, Okan

doi:10.1016/j.eswa.2023.121199

A new method for GAN-based data augmentation for classes with distinct clusters

Atıf İçin Kopyala

Kuntalp M., Düzyel O.

Expert Systems with Applications, cilt.235, 2024 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 235
Basım Tarihi: 2024
Doi Numarası: 10.1016/j.eswa.2023.121199
Dergi Adı: Expert Systems with Applications
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
Anahtar Kelimeler: Data augmentation, ECG, Generative adversarial neural networks, t-SNE
Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Data augmentation is a commonly used approach for addressing the issue of limited data availability in machine learning. There are various methods available, including classical and modern techniques. However, when applying modern data augmentation methods, such as Generative Adversarial Neural Networks (GANs), to a class specific data, the resulting data can exhibit structural discrepancies. This study explores a different use of GANs as a data augmentation method that solves this problem using the electrocardiogram (ECG) signals in the MIT-BIH arrhythmia dataset as the example. We begin by examining the cluster structure of a specific class using t-Distributed Stochastic Neighbor (t-SNE) method. Based on this cluster structure, we propose a new method for applying GANs to augment data for that class. We assess the effect of our method in a classification task using 1-D Convolutional Neural Network (CNN), Support Vector Machine (SVM), One vs one classifier (Ovo), K-Nearest Neighbors (KNN), and Random Forest as the classifiers. The results demonstrate that our proposed method could lead to better classification performance if a specific class has distinct clusters when compared to normal use of GANs.