Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks

This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synt...

Full description

Saved in:
Bibliographic Details
Main Authors: Archana Venugopal, Diego Resende Faria
Format: Article
Language:English
Published: MDPI AG 2024-11-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/23/10818
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850061558511566848
author Archana Venugopal
Diego Resende Faria
author_facet Archana Venugopal
Diego Resende Faria
author_sort Archana Venugopal
collection DOAJ
description This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and abnormal states. By addressing the challenges of limited biophysical data, including privacy concerns and restricted volunteer availability, our model generates realistic synthetic waveforms learned from real data. Combining real and synthetic datasets improved classification accuracy from 92% to 98.45%, highlighting the benefits of dataset augmentation for machine learning performance. The WGAN-GP model achieved 96.84% classification accuracy for synthetic EEG data representing relaxation states and optimal accuracy for concentration states when classified using a fusion of convolutional neural networks (CNNs). A 50% combination of synthetic and real EEG data yielded the highest accuracy of 98.48%. For EEG signals, the real dataset consisted of 60-s recordings across four channels (TP9, AF7, AF8, and TP10) from four individuals, providing approximately 15,000 data points per subject per state. For ECG signals, the dataset contained 1200 real samples, each comprising 140 data points, representing normal and abnormal states. WGAN-GP outperformed a basic generative adversarial network (GAN) in generating reliable synthetic data. For ECG data, a support vector machine (SVM) classifier achieved an accuracy of 98% with real data and 95.8% with synthetic data. Synthetic ECG data improved the random forest (RF) classifier’s accuracy from 97% with real data alone to 98.40% when combined with synthetic data. Statistical significance was assessed using the Wilcoxon signed-rank test, demonstrating the robustness of the WGAN-GP model. Techniques such as discrete wavelet transform, downsampling, and upsampling were employed to enhance data quality. This method shows significant potential in addressing biophysical data scarcity and advancing applications in assistive technologies, human-robot interaction, and mental health monitoring, among other medical applications.
format Article
id doaj-art-607690f56dd040fc92de548109eea1aa
institution DOAJ
issn 2076-3417
language English
publishDate 2024-11-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-607690f56dd040fc92de548109eea1aa2025-08-20T02:50:12ZengMDPI AGApplied Sciences2076-34172024-11-0114231081810.3390/app142310818Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial NetworksArchana Venugopal0Diego Resende Faria1School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, Hertfordshire, UKSchool of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, Hertfordshire, UKThis study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and abnormal states. By addressing the challenges of limited biophysical data, including privacy concerns and restricted volunteer availability, our model generates realistic synthetic waveforms learned from real data. Combining real and synthetic datasets improved classification accuracy from 92% to 98.45%, highlighting the benefits of dataset augmentation for machine learning performance. The WGAN-GP model achieved 96.84% classification accuracy for synthetic EEG data representing relaxation states and optimal accuracy for concentration states when classified using a fusion of convolutional neural networks (CNNs). A 50% combination of synthetic and real EEG data yielded the highest accuracy of 98.48%. For EEG signals, the real dataset consisted of 60-s recordings across four channels (TP9, AF7, AF8, and TP10) from four individuals, providing approximately 15,000 data points per subject per state. For ECG signals, the dataset contained 1200 real samples, each comprising 140 data points, representing normal and abnormal states. WGAN-GP outperformed a basic generative adversarial network (GAN) in generating reliable synthetic data. For ECG data, a support vector machine (SVM) classifier achieved an accuracy of 98% with real data and 95.8% with synthetic data. Synthetic ECG data improved the random forest (RF) classifier’s accuracy from 97% with real data alone to 98.40% when combined with synthetic data. Statistical significance was assessed using the Wilcoxon signed-rank test, demonstrating the robustness of the WGAN-GP model. Techniques such as discrete wavelet transform, downsampling, and upsampling were employed to enhance data quality. This method shows significant potential in addressing biophysical data scarcity and advancing applications in assistive technologies, human-robot interaction, and mental health monitoring, among other medical applications.https://www.mdpi.com/2076-3417/14/23/10818synthetic data generationGANWGAN-GPEEGECG
spellingShingle Archana Venugopal
Diego Resende Faria
Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
Applied Sciences
synthetic data generation
GAN
WGAN-GP
EEG
ECG
title Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_full Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_fullStr Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_full_unstemmed Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_short Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_sort boosting eeg and ecg classification with synthetic biophysical data generated via generative adversarial networks
topic synthetic data generation
GAN
WGAN-GP
EEG
ECG
url https://www.mdpi.com/2076-3417/14/23/10818
work_keys_str_mv AT archanavenugopal boostingeegandecgclassificationwithsyntheticbiophysicaldatageneratedviagenerativeadversarialnetworks
AT diegoresendefaria boostingeegandecgclassificationwithsyntheticbiophysicaldatageneratedviagenerativeadversarialnetworks