A new method for GAN-based data augmentation for classes with distinct clusters


Kuntalp M., Düzyel O.

Expert Systems with Applications, cilt.235, 2024 (SCI-Expanded) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 235
  • Basım Tarihi: 2024
  • Doi Numarası: 10.1016/j.eswa.2023.121199
  • Dergi Adı: Expert Systems with Applications
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus
  • Anahtar Kelimeler: Data augmentation, ECG, Generative adversarial neural networks, t-SNE
  • Dokuz Eylül Üniversitesi Adresli: Evet

Özet

Data augmentation is a commonly used approach for addressing the issue of limited data availability in machine learning. There are various methods available, including classical and modern techniques. However, when applying modern data augmentation methods, such as Generative Adversarial Neural Networks (GANs), to a class specific data, the resulting data can exhibit structural discrepancies. This study explores a different use of GANs as a data augmentation method that solves this problem using the electrocardiogram (ECG) signals in the MIT-BIH arrhythmia dataset as the example. We begin by examining the cluster structure of a specific class using t-Distributed Stochastic Neighbor (t-SNE) method. Based on this cluster structure, we propose a new method for applying GANs to augment data for that class. We assess the effect of our method in a classification task using 1-D Convolutional Neural Network (CNN), Support Vector Machine (SVM), One vs one classifier (Ovo), K-Nearest Neighbors (KNN), and Random Forest as the classifiers. The results demonstrate that our proposed method could lead to better classification performance if a specific class has distinct clusters when compared to normal use of GANs.