A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.

Recent advancements in hardware technology have spurred a surge in the popularity and ubiquity of wearable sensors, opening up new applications within the medical domain. This proliferation has resulted in a notable increase in the availability of Time Series (TS) data characterizing behavioral or p...

Full description

Saved in:
Bibliographic Details
Main Authors: Md Abid Hasan, Frédéric Li, Philip Gouverneur, Artur Piet, Marcin Grzegorzek
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0315343
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850128569499385856
author Md Abid Hasan
Frédéric Li
Philip Gouverneur
Artur Piet
Marcin Grzegorzek
author_facet Md Abid Hasan
Frédéric Li
Philip Gouverneur
Artur Piet
Marcin Grzegorzek
author_sort Md Abid Hasan
collection DOAJ
description Recent advancements in hardware technology have spurred a surge in the popularity and ubiquity of wearable sensors, opening up new applications within the medical domain. This proliferation has resulted in a notable increase in the availability of Time Series (TS) data characterizing behavioral or physiological information from the patient, leading to initiatives toward leveraging machine learning and data analysis techniques. Nonetheless, the complexity and time required for collecting data remain significant hurdles, limiting dataset sizes and hindering the effectiveness of machine learning. Data Augmentation (DA) stands out as a prime solution, facilitating the generation of synthetic data to address challenges associated with acquiring medical data. DA has shown to consistently improve performances when images are involved. As a result, investigations have been carried out to check DA for TS, in particular for TS classification. However, the current state of DA in TS classification faces challenges, including methodological taxonomies restricted to the univariate case, insufficient direction to select suitable DA methods and a lack of conclusive evidence regarding the amount of synthetic data required to attain optimal outcomes. This paper conducts a comprehensive survey and experiments on DA techniques for TS and their application to TS classification. We propose an updated taxonomy spanning across three families of Time Series Data Augmentation (TSDA): Random Transformation (RT), Pattern Mixing (PM), and Generative Models (GM). Additionally, we empirically evaluate 12 TSDA methods across diverse datasets used in medical-related applications, including OPPORTUNITY and HAR for Human Activity Recognition, DEAP for emotion recognition, BioVid Heat Pain Database (BVDB), and PainMonit Database (PMDB) for pain recognition. Through comprehensive experimental analysis, we identify the most optimal DA techniques and provide recommendations for researchers regarding the generation of synthetic data to maximize outcomes from DA methods. Our findings show that despite their simplicity, DA methods of the RT family are the most consistent in increasing performances compared to not using any augmentation.
format Article
id doaj-art-5c82a7b4e6d34db6b547c632c59ac54e
institution OA Journals
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-5c82a7b4e6d34db6b547c632c59ac54e2025-08-20T02:33:15ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01203e031534310.1371/journal.pone.0315343A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.Md Abid HasanFrédéric LiPhilip GouverneurArtur PietMarcin GrzegorzekRecent advancements in hardware technology have spurred a surge in the popularity and ubiquity of wearable sensors, opening up new applications within the medical domain. This proliferation has resulted in a notable increase in the availability of Time Series (TS) data characterizing behavioral or physiological information from the patient, leading to initiatives toward leveraging machine learning and data analysis techniques. Nonetheless, the complexity and time required for collecting data remain significant hurdles, limiting dataset sizes and hindering the effectiveness of machine learning. Data Augmentation (DA) stands out as a prime solution, facilitating the generation of synthetic data to address challenges associated with acquiring medical data. DA has shown to consistently improve performances when images are involved. As a result, investigations have been carried out to check DA for TS, in particular for TS classification. However, the current state of DA in TS classification faces challenges, including methodological taxonomies restricted to the univariate case, insufficient direction to select suitable DA methods and a lack of conclusive evidence regarding the amount of synthetic data required to attain optimal outcomes. This paper conducts a comprehensive survey and experiments on DA techniques for TS and their application to TS classification. We propose an updated taxonomy spanning across three families of Time Series Data Augmentation (TSDA): Random Transformation (RT), Pattern Mixing (PM), and Generative Models (GM). Additionally, we empirically evaluate 12 TSDA methods across diverse datasets used in medical-related applications, including OPPORTUNITY and HAR for Human Activity Recognition, DEAP for emotion recognition, BioVid Heat Pain Database (BVDB), and PainMonit Database (PMDB) for pain recognition. Through comprehensive experimental analysis, we identify the most optimal DA techniques and provide recommendations for researchers regarding the generation of synthetic data to maximize outcomes from DA methods. Our findings show that despite their simplicity, DA methods of the RT family are the most consistent in increasing performances compared to not using any augmentation.https://doi.org/10.1371/journal.pone.0315343
spellingShingle Md Abid Hasan
Frédéric Li
Philip Gouverneur
Artur Piet
Marcin Grzegorzek
A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
PLoS ONE
title A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
title_full A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
title_fullStr A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
title_full_unstemmed A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
title_short A comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing.
title_sort comprehensive survey and comparative analysis of time series data augmentation in medical wearable computing
url https://doi.org/10.1371/journal.pone.0315343
work_keys_str_mv AT mdabidhasan acomprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT fredericli acomprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT philipgouverneur acomprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT arturpiet acomprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT marcingrzegorzek acomprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT mdabidhasan comprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT fredericli comprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT philipgouverneur comprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT arturpiet comprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing
AT marcingrzegorzek comprehensivesurveyandcomparativeanalysisoftimeseriesdataaugmentationinmedicalwearablecomputing