Limited Performance of Machine Learning Models Developed Based on Demographic and Laboratory Data Obtained Before Primary Treatment to Predict Coronary Aneurysms
<b>Background/objectives</b>: Kawasaki disease is the leading cause of acquired heart disease in children within developed countries. Although treatment with intravenous immunoglobulin (IVIG) significantly reduces the incidence of coronary artery aneurysm (CAA), the risk of it persists,...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-04-01
|
| Series: | Biomedicines |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2227-9059/13/5/1073 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | <b>Background/objectives</b>: Kawasaki disease is the leading cause of acquired heart disease in children within developed countries. Although treatment with intravenous immunoglobulin (IVIG) significantly reduces the incidence of coronary artery aneurysm (CAA), the risk of it persists, affecting long-term patient outcomes. While intensified primary treatment is recommended for patients at high risk of IVIG resistance or CAA development, a universally accepted predictive model for such resistance remains unestablished. This study aims to develop a machine learning model to predict the occurrence of CAAs prior to initiating IVIG therapy. <b>Methods</b>: Data from two nationwide epidemiological surveys conducted between 2012 and 2017 were analyzed, encompassing 17,189 patients with calculable coronary artery z-scores and Harada scores. Various supervised machine learning algorithms were applied to develop a model for predicting CAA. Afterward, unsupervised learning techniques were employed to explore the data’s inherent structure. <b>Results</b>: The Harada score’s receiver operating characteristic (ROC) analysis yielded an area under the curve (AUC) of 0.558. The highest AUC among the machine learning models was 0.661, achieved by the Light Gradient Boosting Machine. However, this model’s sensitivity was 0.615, and specificity was 0.647, indicating limited clinical applicability. Unsupervised learning revealed no distinct distribution patterns between patients with/without CAAs. <b>Conclusions</b>: Despite utilizing a large dataset to develop a machine learning-based prediction model for CAAs, the performance was unsatisfactory. Future studies should focus on enhancing predictive models by incorporating additional clinical data, such as acute-phase coronary artery diameter measurements, to improve accuracy and clinical utility. |
|---|---|
| ISSN: | 2227-9059 |