Predicting COVID-19 severity in pediatric patients using machine learning: a comparative analysis of algorithms and ensemble methods

Abstract COVID-19 has posed a significant global health challenge, affecting individuals across all age groups. While extensive research has focused on adults, pediatric patients exhibit distinct clinical characteristics that necessitate specialized predictive models for disease severity. Machine le...

Full description

Saved in:
Bibliographic Details
Main Authors: Babak Pourakbari, Setareh Mamishi, Sepideh Keshavarz Valian, Shima Mahmoudi, Reihaneh Hosseinpour Sadeghi, Mohammad Reza Abdolsalehi, Mahmoud Khodabandeh, Mohammad Farahmand
Format: Article
Language:English
Published: Nature Portfolio 2025-08-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-15366-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract COVID-19 has posed a significant global health challenge, affecting individuals across all age groups. While extensive research has focused on adults, pediatric patients exhibit distinct clinical characteristics that necessitate specialized predictive models for disease severity. Machine learning offers a powerful approach to analyzing complex datasets and predicting outcomes, yet its application in pediatric COVID-19 remains limited. This study evaluates the performance of machine learning algorithms in predicting disease severity among pediatrics. A retrospective analysis was conducted on a dataset of 588 pediatric with confirmed COVID-19, incorporating demographic, clinical, and laboratory variables. Various machine learning models were trained and assessed, with a SuperLearner ensemble model implemented to enhance predictive accuracy. Among the models, Random Forest exhibited the highest performance, achieving an accuracy of 90.1%, sensitivity of 90.2%, and specificity of 90.1%. The SuperLearner ensemble further improved predictive performance, demonstrating the lowest mean risk estimate. Key predictors, including oxygen saturation, respiratory parameters, and specific laboratory markers, played a crucial role in distinguishing severe from non-severe cases. These findings emphasize the potential of machine learning, particularly ensemble methods, in improving risk stratification for pediatric COVID-19. Integrating these predictive models into clinical practice could support early identification of high-risk patients and optimize clinical decision-making.
ISSN:2045-2322