Enhanced early detection of dysarthric speech disabilities using stacking ensemble deep learning model
Communication disorders, particularly dysarthria, significantly impact individuals by impairing their speech clarity, social interactions, and overall well-being. Early and accurate detection is crucial to enable timely intervention and improve speech therapy outcomes. This study introduces Adaptive...
Saved in:
| Main Authors: | , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-09-01
|
| Series: | Machine Learning with Applications |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2666827025001045 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Communication disorders, particularly dysarthria, significantly impact individuals by impairing their speech clarity, social interactions, and overall well-being. Early and accurate detection is crucial to enable timely intervention and improve speech therapy outcomes. This study introduces Adaptive Dysarthric Speech Disability Detection using Stacked Ensemble Deep Learning (ADSDD-SEDL), an innovative ensemble-based deep-learning framework for dysarthria detection. The proposed model integrates three deep learning architectures—Multi-Head Attention-based Long Short-Term Memory (MHALSTM), Deep Belief Network (DBN), and Time-Delay Neural Network (TDNN)—within a stacked ensemble model. Unlike conventional stacking methods that use fixed meta-classifiers, this study employs a Genetic Algorithm (GA)-based optimization strategy to dynamically determine optimal weight contributions of the base models, enhancing classification robustness and adaptability.The preprocessing pipeline converts speech signals from the time domain to the frequency domain by using a Short-Time Fourier Transform (STFT). Mel-Frequency Cepstral Coefficients (MFCCs) were extracted to capture the key spectral characteristics. Each base model underwent independent training, and the GA optimized the ensemble by evolving an adaptive weight distribution instead of relying on predefined fusion methods. Extensive simulations and hyperparameter tuning confirmed that the GA-optimized ADSDD-SEDL technique significantly improved detection efficiency over traditional ensemble approaches. These findings underscore the advantages of evolutionary optimization in refining speech disorder classification models. This scalable and adaptive model offers a valuable tool for healthcare professionals, enabling precise and automated early diagnosis of dysarthria. Future research could explore alternative evolutionary algorithms, reinforcement learning techniques, and hybrid deep learning approaches to enhance speech disorder classification. |
|---|---|
| ISSN: | 2666-8270 |