Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis

This study aims to identify personal, clinical, and acoustic predictors of therapy outcomes based on changes in Korean voice-related quality of life (K-VRQOL) scores, as well as to compare the predictive performance of traditional regression and machine learning models. A total of 102 participants u...

Full description

Saved in:
Bibliographic Details
Main Authors: Ji Hye Park, Ah Ra Jung, Ji-Na Lee, Ji-Yeoun Lee
Format: Article
Language:English
Published: MDPI AG 2025-06-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/15/13/7045
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849319901941989376
author Ji Hye Park
Ah Ra Jung
Ji-Na Lee
Ji-Yeoun Lee
author_facet Ji Hye Park
Ah Ra Jung
Ji-Na Lee
Ji-Yeoun Lee
author_sort Ji Hye Park
collection DOAJ
description This study aims to identify personal, clinical, and acoustic predictors of therapy outcomes based on changes in Korean voice-related quality of life (K-VRQOL) scores, as well as to compare the predictive performance of traditional regression and machine learning models. A total of 102 participants undergoing voice therapy are retrospectively analyzed. Multiple regression analysis and four machine learning algorithms—random forest (RF), gradient boosting (GB), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost)—are applied to predict changes in K-VRQOL scores across the total, physical, and emotional domains. The Shapley additive explanations (SHAP) approach is used to evaluate the relative contribution of each variable to the prediction outcomes. Female gender and comorbidity status emerge as significant predictors in both the total and physical domains. Among the acoustic features, jitter, SFF, and MPT are closely associated with improvements in physical voice function. LightGBM demonstrates the best overall performance, particularly in the total domain (R<sup>2</sup> = 32.54%), while GB excels in the physical domain. The emotional domain shows relatively low predictive power across the models. SHAP analysis reveals interpretable patterns, highlighting jitter and speaking fundamental frequency (SFF) as key contributors in high-performing models. Integrating statistical and machine learning approaches provides a robust framework for predicting and interpreting voice therapy outcomes. These findings support the use of explainable artificial intelligence (AI) to enhance clinical decision-making and pave the way for personalized voice rehabilitation strategies.
format Article
id doaj-art-e027a0ee72a3416faa523dd328ecb16a
institution Kabale University
issn 2076-3417
language English
publishDate 2025-06-01
publisher MDPI AG
record_format Article
series Applied Sciences
spelling doaj-art-e027a0ee72a3416faa523dd328ecb16a2025-08-20T03:50:17ZengMDPI AGApplied Sciences2076-34172025-06-011513704510.3390/app15137045Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based AnalysisJi Hye Park0Ah Ra Jung1Ji-Na Lee2Ji-Yeoun Lee3Department of Bigdata Medical Convergence, Eulji University, 553 Sanseong-daero, Sujeong-gu, Seongnam-si 13135, Republic of KoreaDepartment of Otorhinolaryngology, Nowon Eulji Medical Center, Eulji University School of Medicine, 68 Hangeulbiseok-ro, Nowon-gu, Seoul 01830, Republic of KoreaDivision of Global Business Languages, Seokyeong University, Seogyeong-ro, Seongbuk-gu, Seoul 02173, Republic of KoreaDepartment of Bigdata Medical Convergence, Eulji University, 553 Sanseong-daero, Sujeong-gu, Seongnam-si 13135, Republic of KoreaThis study aims to identify personal, clinical, and acoustic predictors of therapy outcomes based on changes in Korean voice-related quality of life (K-VRQOL) scores, as well as to compare the predictive performance of traditional regression and machine learning models. A total of 102 participants undergoing voice therapy are retrospectively analyzed. Multiple regression analysis and four machine learning algorithms—random forest (RF), gradient boosting (GB), light gradient boosting machine (LightGBM), and extreme gradient boosting (XGBoost)—are applied to predict changes in K-VRQOL scores across the total, physical, and emotional domains. The Shapley additive explanations (SHAP) approach is used to evaluate the relative contribution of each variable to the prediction outcomes. Female gender and comorbidity status emerge as significant predictors in both the total and physical domains. Among the acoustic features, jitter, SFF, and MPT are closely associated with improvements in physical voice function. LightGBM demonstrates the best overall performance, particularly in the total domain (R<sup>2</sup> = 32.54%), while GB excels in the physical domain. The emotional domain shows relatively low predictive power across the models. SHAP analysis reveals interpretable patterns, highlighting jitter and speaking fundamental frequency (SFF) as key contributors in high-performing models. Integrating statistical and machine learning approaches provides a robust framework for predicting and interpreting voice therapy outcomes. These findings support the use of explainable artificial intelligence (AI) to enhance clinical decision-making and pave the way for personalized voice rehabilitation strategies.https://www.mdpi.com/2076-3417/15/13/7045Korean voice-related quality of lifeShapley additive explanationsvoice therapyvoice disordersmachine learningregression analysis
spellingShingle Ji Hye Park
Ah Ra Jung
Ji-Na Lee
Ji-Yeoun Lee
Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
Applied Sciences
Korean voice-related quality of life
Shapley additive explanations
voice therapy
voice disorders
machine learning
regression analysis
title Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
title_full Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
title_fullStr Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
title_full_unstemmed Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
title_short Prediction of Voice Therapy Outcomes Using Machine Learning Approaches and SHAP Analysis: A K-VRQOL-Based Analysis
title_sort prediction of voice therapy outcomes using machine learning approaches and shap analysis a k vrqol based analysis
topic Korean voice-related quality of life
Shapley additive explanations
voice therapy
voice disorders
machine learning
regression analysis
url https://www.mdpi.com/2076-3417/15/13/7045
work_keys_str_mv AT jihyepark predictionofvoicetherapyoutcomesusingmachinelearningapproachesandshapanalysisakvrqolbasedanalysis
AT ahrajung predictionofvoicetherapyoutcomesusingmachinelearningapproachesandshapanalysisakvrqolbasedanalysis
AT jinalee predictionofvoicetherapyoutcomesusingmachinelearningapproachesandshapanalysisakvrqolbasedanalysis
AT jiyeounlee predictionofvoicetherapyoutcomesusingmachinelearningapproachesandshapanalysisakvrqolbasedanalysis