Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach

Introduction The epidemiology and clinical characteristics of COVID-19 evolved due to new SARS-CoV-2 variants of concern (VOCs). The Omicron VOC’s higher transmissibility increased paediatric COVID-19 cases and hospital admissions. Most research during the Omicron period has focused on hospitalised...

Full description

Saved in:
Bibliographic Details
Main Authors: Carlo Giaquinto, Daniela Paolotti, Daniele Donà, Stefania Fiandrino, Piero Poletti, Michael Davis Tira, Costanza Di Chiara
Format: Article
Language:English
Published: BMJ Publishing Group 2025-06-01
Series:BMJ Public Health
Online Access:https://bmjpublichealth.bmj.com/content/3/1/e001888.full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849715014346211328
author Carlo Giaquinto
Daniela Paolotti
Daniele Donà
Stefania Fiandrino
Piero Poletti
Michael Davis Tira
Costanza Di Chiara
author_facet Carlo Giaquinto
Daniela Paolotti
Daniele Donà
Stefania Fiandrino
Piero Poletti
Michael Davis Tira
Costanza Di Chiara
author_sort Carlo Giaquinto
collection DOAJ
description Introduction The epidemiology and clinical characteristics of COVID-19 evolved due to new SARS-CoV-2 variants of concern (VOCs). The Omicron VOC’s higher transmissibility increased paediatric COVID-19 cases and hospital admissions. Most research during the Omicron period has focused on hospitalised cases, leaving a gap in understanding the disease’s evolution in community settings. This study targets children with mild to moderate COVID-19 during pre-Omicron and Omicron periods. It aims to identify patterns in COVID-19 morbidity by clustering individuals based on symptom similarities and duration of symptoms and develop a machine-learning tool to classify new cases into risk groups.Methods We propose a data-driven approach to explore changes in COVID-19 characteristics by analysing data from 581 children and adolescents collected within a paediatric cohort at the University Hospital of Padua. First, we apply an unsupervised machine-learning algorithm to cluster individuals into groups. Second, we classify new patient risk groups using a random forest classifier model based on sociodemographic information, pre-existing medical conditions, vaccination status and the VOC as predictive variables. Third, we explore the key features influencing the classification through the SHapley Additive exPlanations.Results The unsupervised clustering identified three severity risk profile groups. Cluster 0 (mildest) had an average of 1.2 symptoms (95% CI 0.0 to 5.0) and mean symptom duration of 1.26 days (95%CI 0.0 to 9.0), cluster 1 had 2.27 symptoms (95% CI 1.0 to 6.0) lasting 3.47 days (95% CI 1.0 to 12.0), while cluster 2 (strongest symptom expression) exhibited 3.41 symptoms (95% CI 2.0 to 7.0) over 5.52 days (95% CI 0.0 to 16.0). Feature importance analysis showed that age was the most important predictor, followed by the variant of infection, influenza vaccination and the presence of comorbidities. The analysis revealed that younger children, unvaccinated individuals, those infected with Omicron and those with comorbidities were at higher risk of experiencing a greater number and longer duration of symptoms.Conclusions Our classification model has the potential to provide clinicians with insights into the children’s risk profile of COVID-19 using readily available data. This approach can support public health by clarifying disease burden and improving patient care strategies. Furthermore, it underscores the importance of integrating risk classification models to monitor and manage infectious diseases.
format Article
id doaj-art-8d63c722d0d4493bb42d36dd0863a40d
institution DOAJ
issn 2753-4294
language English
publishDate 2025-06-01
publisher BMJ Publishing Group
record_format Article
series BMJ Public Health
spelling doaj-art-8d63c722d0d4493bb42d36dd0863a40d2025-08-20T03:13:32ZengBMJ Publishing GroupBMJ Public Health2753-42942025-06-013110.1136/bmjph-2024-001888Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approachCarlo Giaquinto0Daniela Paolotti1Daniele Donà2Stefania Fiandrino3Piero Poletti4Michael Davis Tira5Costanza Di Chiara6Department of Women’s and Children’s Health, Università degli Studi di Padova, Padua, ItalyISI Foundation, Torino, ItalyDepartment of Women’s and Children’s Health, Università degli Studi di Padova, Padua, ItalyUniversity of Rome La Sapienza, Rome, ItalyFondazione Bruno Kessler, Trento, ItalyPenta Foundation, Padua, ItalyDepartment of Women’s and Children’s Health, Università degli Studi di Padova, Padua, ItalyIntroduction The epidemiology and clinical characteristics of COVID-19 evolved due to new SARS-CoV-2 variants of concern (VOCs). The Omicron VOC’s higher transmissibility increased paediatric COVID-19 cases and hospital admissions. Most research during the Omicron period has focused on hospitalised cases, leaving a gap in understanding the disease’s evolution in community settings. This study targets children with mild to moderate COVID-19 during pre-Omicron and Omicron periods. It aims to identify patterns in COVID-19 morbidity by clustering individuals based on symptom similarities and duration of symptoms and develop a machine-learning tool to classify new cases into risk groups.Methods We propose a data-driven approach to explore changes in COVID-19 characteristics by analysing data from 581 children and adolescents collected within a paediatric cohort at the University Hospital of Padua. First, we apply an unsupervised machine-learning algorithm to cluster individuals into groups. Second, we classify new patient risk groups using a random forest classifier model based on sociodemographic information, pre-existing medical conditions, vaccination status and the VOC as predictive variables. Third, we explore the key features influencing the classification through the SHapley Additive exPlanations.Results The unsupervised clustering identified three severity risk profile groups. Cluster 0 (mildest) had an average of 1.2 symptoms (95% CI 0.0 to 5.0) and mean symptom duration of 1.26 days (95%CI 0.0 to 9.0), cluster 1 had 2.27 symptoms (95% CI 1.0 to 6.0) lasting 3.47 days (95% CI 1.0 to 12.0), while cluster 2 (strongest symptom expression) exhibited 3.41 symptoms (95% CI 2.0 to 7.0) over 5.52 days (95% CI 0.0 to 16.0). Feature importance analysis showed that age was the most important predictor, followed by the variant of infection, influenza vaccination and the presence of comorbidities. The analysis revealed that younger children, unvaccinated individuals, those infected with Omicron and those with comorbidities were at higher risk of experiencing a greater number and longer duration of symptoms.Conclusions Our classification model has the potential to provide clinicians with insights into the children’s risk profile of COVID-19 using readily available data. This approach can support public health by clarifying disease burden and improving patient care strategies. Furthermore, it underscores the importance of integrating risk classification models to monitor and manage infectious diseases.https://bmjpublichealth.bmj.com/content/3/1/e001888.full
spellingShingle Carlo Giaquinto
Daniela Paolotti
Daniele Donà
Stefania Fiandrino
Piero Poletti
Michael Davis Tira
Costanza Di Chiara
Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
BMJ Public Health
title Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
title_full Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
title_fullStr Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
title_full_unstemmed Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
title_short Clinical characteristics of COVID-19 in children and adolescents: insights from an Italian paediatric cohort using a machine-learning approach
title_sort clinical characteristics of covid 19 in children and adolescents insights from an italian paediatric cohort using a machine learning approach
url https://bmjpublichealth.bmj.com/content/3/1/e001888.full
work_keys_str_mv AT carlogiaquinto clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT danielapaolotti clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT danieledona clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT stefaniafiandrino clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT pieropoletti clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT michaeldavistira clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach
AT costanzadichiara clinicalcharacteristicsofcovid19inchildrenandadolescentsinsightsfromanitalianpaediatriccohortusingamachinelearningapproach