Early childhood caries risk prediction using machine learning approaches in Bangladesh

Abstract Background In the last years, artificial intelligence (AI) has contributed to improving healthcare including dentistry. The objective of this study was to develop a machine learning (ML) model for early childhood caries (ECC) prediction by identifying crucial health behaviours within mother...

Full description

Saved in:
Bibliographic Details
Main Authors: Fardous Hasan, Maha El Tantawi, Farzana Haque, Moréniké Oluwátóyìn Foláyan, Jorma I. Virtanen
Format: Article
Language:English
Published: BMC 2025-01-01
Series:BMC Oral Health
Subjects:
Online Access:https://doi.org/10.1186/s12903-025-05419-2
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544231998455808
author Fardous Hasan
Maha El Tantawi
Farzana Haque
Moréniké Oluwátóyìn Foláyan
Jorma I. Virtanen
author_facet Fardous Hasan
Maha El Tantawi
Farzana Haque
Moréniké Oluwátóyìn Foláyan
Jorma I. Virtanen
author_sort Fardous Hasan
collection DOAJ
description Abstract Background In the last years, artificial intelligence (AI) has contributed to improving healthcare including dentistry. The objective of this study was to develop a machine learning (ML) model for early childhood caries (ECC) prediction by identifying crucial health behaviours within mother-child pairs. Methods For the analysis, we utilized a representative sample of 724 mothers with children under six years in Bangladesh. The study utilized both clinical and survey data. ECC was assessed using ICDAS II criteria in the clinical examinations. Recursive Feature Elimination (RFE) and Random Forest (RF) was applied to identify the optimal subsets of features. Random forest classifier (RFC), extreme gradient boosting (XGBoost), support vector machine (SVM), adaptive boosting (AdaBoost), and multi-layer perceptron (MLP) models were used to identify the best fitted model as the predictor of ECC. SHAP and MDG-MDA plots were visualized for model interpretability and identify significant predictors. Results The RFC model identified 10 features as the most relevant for ECC prediction obtained by RFE feature selection method. The features were: plaque score, age of child, mother’s education, number of siblings, age of mother, consumption of sweet, tooth cleaning tools, child’s tooth brushing frequency, helping child brushing, and use of F-toothpaste. The final ML model achieved an AUC-ROC score (0.77), accuracy (0.72), sensitivity (0.80) and F1 score (0.73) in the test set. Of the prediction model, dental plaque was the strongest predictor of ECC (MDG: 0.08, MDA: 0.10). Conclusions Our final ML model, integrating 10 key features, has the potential to predict ECC effectively in children under five years. Additional research is needed for validation and optimization across various groups.
format Article
id doaj-art-493c4a6272da4976981b1c3fdbf3c2ac
institution Kabale University
issn 1472-6831
language English
publishDate 2025-01-01
publisher BMC
record_format Article
series BMC Oral Health
spelling doaj-art-493c4a6272da4976981b1c3fdbf3c2ac2025-01-12T12:42:09ZengBMCBMC Oral Health1472-68312025-01-0125111110.1186/s12903-025-05419-2Early childhood caries risk prediction using machine learning approaches in BangladeshFardous Hasan0Maha El Tantawi1Farzana Haque2Moréniké Oluwátóyìn Foláyan3Jorma I. Virtanen4Department of Clinical Dentistry, Faculty of Medicine, University of BergenDepartment of Pediatric Dentistry and Dental Public Health, Faculty of Dentistry, Alexandria UniversityDepartment of Clinical Dentistry, Faculty of Medicine, University of BergenEarly Childhood Caries Advocacy Group, University of ManitobaDepartment of Clinical Dentistry, Faculty of Medicine, University of BergenAbstract Background In the last years, artificial intelligence (AI) has contributed to improving healthcare including dentistry. The objective of this study was to develop a machine learning (ML) model for early childhood caries (ECC) prediction by identifying crucial health behaviours within mother-child pairs. Methods For the analysis, we utilized a representative sample of 724 mothers with children under six years in Bangladesh. The study utilized both clinical and survey data. ECC was assessed using ICDAS II criteria in the clinical examinations. Recursive Feature Elimination (RFE) and Random Forest (RF) was applied to identify the optimal subsets of features. Random forest classifier (RFC), extreme gradient boosting (XGBoost), support vector machine (SVM), adaptive boosting (AdaBoost), and multi-layer perceptron (MLP) models were used to identify the best fitted model as the predictor of ECC. SHAP and MDG-MDA plots were visualized for model interpretability and identify significant predictors. Results The RFC model identified 10 features as the most relevant for ECC prediction obtained by RFE feature selection method. The features were: plaque score, age of child, mother’s education, number of siblings, age of mother, consumption of sweet, tooth cleaning tools, child’s tooth brushing frequency, helping child brushing, and use of F-toothpaste. The final ML model achieved an AUC-ROC score (0.77), accuracy (0.72), sensitivity (0.80) and F1 score (0.73) in the test set. Of the prediction model, dental plaque was the strongest predictor of ECC (MDG: 0.08, MDA: 0.10). Conclusions Our final ML model, integrating 10 key features, has the potential to predict ECC effectively in children under five years. Additional research is needed for validation and optimization across various groups.https://doi.org/10.1186/s12903-025-05419-2Dental cariesChildrenArtificial intelligenceMachine learningRisk
spellingShingle Fardous Hasan
Maha El Tantawi
Farzana Haque
Moréniké Oluwátóyìn Foláyan
Jorma I. Virtanen
Early childhood caries risk prediction using machine learning approaches in Bangladesh
BMC Oral Health
Dental caries
Children
Artificial intelligence
Machine learning
Risk
title Early childhood caries risk prediction using machine learning approaches in Bangladesh
title_full Early childhood caries risk prediction using machine learning approaches in Bangladesh
title_fullStr Early childhood caries risk prediction using machine learning approaches in Bangladesh
title_full_unstemmed Early childhood caries risk prediction using machine learning approaches in Bangladesh
title_short Early childhood caries risk prediction using machine learning approaches in Bangladesh
title_sort early childhood caries risk prediction using machine learning approaches in bangladesh
topic Dental caries
Children
Artificial intelligence
Machine learning
Risk
url https://doi.org/10.1186/s12903-025-05419-2
work_keys_str_mv AT fardoushasan earlychildhoodcariesriskpredictionusingmachinelearningapproachesinbangladesh
AT mahaeltantawi earlychildhoodcariesriskpredictionusingmachinelearningapproachesinbangladesh
AT farzanahaque earlychildhoodcariesriskpredictionusingmachinelearningapproachesinbangladesh
AT morenikeoluwatoyinfolayan earlychildhoodcariesriskpredictionusingmachinelearningapproachesinbangladesh
AT jormaivirtanen earlychildhoodcariesriskpredictionusingmachinelearningapproachesinbangladesh