Transformer-based transfer learning on self-reported voice recordings for Parkinson’s disease diagnosis

Abstract Deep learning (DL) techniques are becoming more popular for diagnosing Parkinson’s disease (PD) because they offer non-invasive and easily accessible tools. By using advanced data analysis, these methods improve early detection and diagnosis, which is crucial for managing the disease effect...

Full description

Saved in:
Bibliographic Details
Main Authors: Ilias Tougui, Mehdi Zakroum, Ouassim Karrakchou, Mounir Ghogho
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-81824-x
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Deep learning (DL) techniques are becoming more popular for diagnosing Parkinson’s disease (PD) because they offer non-invasive and easily accessible tools. By using advanced data analysis, these methods improve early detection and diagnosis, which is crucial for managing the disease effectively. This study explores end-to-end DL architectures, such as convolutional neural networks and transformers, for diagnosing PD using self-reported voice data collected via smartphones in everyday settings. Transfer learning was applied by starting with models pre-trained on large datasets from the image and the audio domains and then fine-tuning them on the mPower voice data. The Transformer model pre-trained on the voice data performed the best, achieving an average AUC of $$95.89\%$$ and an average AUPRC of $$87.11\%$$ , outperforming models trained from scratch. To the best of our knowledge, this is the first use of a Transformer model for audio data in PD diagnosis, using this dataset. We achieved better results than previous studies, whether they focused solely on the voice or incorporated multiple modalities, by relying only on the voice as a biomarker. These results show that using self-reported voice data with state-of-the-art DL architectures can significantly improve PD prediction and diagnosis, potentially leading to better patient outcomes.
ISSN:2045-2322