Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks

This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synt...

Full description

Saved in:

Bibliographic Details
Main Authors:	Archana Venugopal, Diego Resende Faria
Format:	Article
Language:	English
Published:	MDPI AG 2024-11-01
Series:	Applied Sciences
Subjects:	synthetic data generation GAN WGAN-GP EEG ECG
Online Access:	https://www.mdpi.com/2076-3417/14/23/10818
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850061558511566848
author	Archana Venugopal Diego Resende Faria
author_facet	Archana Venugopal Diego Resende Faria
author_sort	Archana Venugopal
collection	DOAJ
description	This study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and abnormal states. By addressing the challenges of limited biophysical data, including privacy concerns and restricted volunteer availability, our model generates realistic synthetic waveforms learned from real data. Combining real and synthetic datasets improved classification accuracy from 92% to 98.45%, highlighting the benefits of dataset augmentation for machine learning performance. The WGAN-GP model achieved 96.84% classification accuracy for synthetic EEG data representing relaxation states and optimal accuracy for concentration states when classified using a fusion of convolutional neural networks (CNNs). A 50% combination of synthetic and real EEG data yielded the highest accuracy of 98.48%. For EEG signals, the real dataset consisted of 60-s recordings across four channels (TP9, AF7, AF8, and TP10) from four individuals, providing approximately 15,000 data points per subject per state. For ECG signals, the dataset contained 1200 real samples, each comprising 140 data points, representing normal and abnormal states. WGAN-GP outperformed a basic generative adversarial network (GAN) in generating reliable synthetic data. For ECG data, a support vector machine (SVM) classifier achieved an accuracy of 98% with real data and 95.8% with synthetic data. Synthetic ECG data improved the random forest (RF) classifier’s accuracy from 97% with real data alone to 98.40% when combined with synthetic data. Statistical significance was assessed using the Wilcoxon signed-rank test, demonstrating the robustness of the WGAN-GP model. Techniques such as discrete wavelet transform, downsampling, and upsampling were employed to enhance data quality. This method shows significant potential in addressing biophysical data scarcity and advancing applications in assistive technologies, human-robot interaction, and mental health monitoring, among other medical applications.
format	Article
id	doaj-art-607690f56dd040fc92de548109eea1aa
institution	DOAJ
issn	2076-3417
language	English
publishDate	2024-11-01
publisher	MDPI AG
record_format	Article
series	Applied Sciences
spelling	doaj-art-607690f56dd040fc92de548109eea1aa2025-08-20T02:50:12ZengMDPI AGApplied Sciences2076-34172024-11-0114231081810.3390/app142310818Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial NetworksArchana Venugopal0Diego Resende Faria1School of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, Hertfordshire, UKSchool of Physics, Engineering and Computer Science, University of Hertfordshire, Hatfield AL10 9AB, Hertfordshire, UKThis study presents a novel approach using Wasserstein Generative Adversarial Networks with Gradient Penalty (WGAN-GP) to generate synthetic electroencephalography (EEG) and electrocardiogram (ECG) waveforms. The synthetic EEG data represent concentration and relaxation mental states, while the synthetic ECG data correspond to normal and abnormal states. By addressing the challenges of limited biophysical data, including privacy concerns and restricted volunteer availability, our model generates realistic synthetic waveforms learned from real data. Combining real and synthetic datasets improved classification accuracy from 92% to 98.45%, highlighting the benefits of dataset augmentation for machine learning performance. The WGAN-GP model achieved 96.84% classification accuracy for synthetic EEG data representing relaxation states and optimal accuracy for concentration states when classified using a fusion of convolutional neural networks (CNNs). A 50% combination of synthetic and real EEG data yielded the highest accuracy of 98.48%. For EEG signals, the real dataset consisted of 60-s recordings across four channels (TP9, AF7, AF8, and TP10) from four individuals, providing approximately 15,000 data points per subject per state. For ECG signals, the dataset contained 1200 real samples, each comprising 140 data points, representing normal and abnormal states. WGAN-GP outperformed a basic generative adversarial network (GAN) in generating reliable synthetic data. For ECG data, a support vector machine (SVM) classifier achieved an accuracy of 98% with real data and 95.8% with synthetic data. Synthetic ECG data improved the random forest (RF) classifier’s accuracy from 97% with real data alone to 98.40% when combined with synthetic data. Statistical significance was assessed using the Wilcoxon signed-rank test, demonstrating the robustness of the WGAN-GP model. Techniques such as discrete wavelet transform, downsampling, and upsampling were employed to enhance data quality. This method shows significant potential in addressing biophysical data scarcity and advancing applications in assistive technologies, human-robot interaction, and mental health monitoring, among other medical applications.https://www.mdpi.com/2076-3417/14/23/10818synthetic data generationGANWGAN-GPEEGECG
spellingShingle	Archana Venugopal Diego Resende Faria Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks Applied Sciences synthetic data generation GAN WGAN-GP EEG ECG
title	Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_full	Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_fullStr	Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_full_unstemmed	Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_short	Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks
title_sort	boosting eeg and ecg classification with synthetic biophysical data generated via generative adversarial networks
topic	synthetic data generation GAN WGAN-GP EEG ECG
url	https://www.mdpi.com/2076-3417/14/23/10818
work_keys_str_mv	AT archanavenugopal boostingeegandecgclassificationwithsyntheticbiophysicaldatageneratedviagenerativeadversarialnetworks AT diegoresendefaria boostingeegandecgclassificationwithsyntheticbiophysicaldatageneratedviagenerativeadversarialnetworks

Boosting EEG and ECG Classification with Synthetic Biophysical Data Generated via Generative Adversarial Networks

Similar Items