Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)

Aim: This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates. Methods: The study utilized data from the National Family and He...

Full description

Saved in:
Bibliographic Details
Main Authors: Pinky Pandey, Sacheendra Shukla, Niraj Kumar Singh, Mukesh Kumar
Format: Article
Language:English
Published: Elsevier 2025-03-01
Series:Clinical Epidemiology and Global Health
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2213398425000387
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861210799407104
author Pinky Pandey
Sacheendra Shukla
Niraj Kumar Singh
Mukesh Kumar
author_facet Pinky Pandey
Sacheendra Shukla
Niraj Kumar Singh
Mukesh Kumar
author_sort Pinky Pandey
collection DOAJ
description Aim: This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates. Methods: The study utilized data from the National Family and Health Survey (NFHS) - V. Four machine learning algorithms—Random Forests, Logistic Regression, K-Nearest Neighbors (KNN), and Naive Bayes—were applied alongside a traditional logistic regression model. Predictive performance was evaluated using metrics such as model accuracy and receiver operating characteristic (ROC) curves. Descriptive analysis highlighted regional variations in under-five mortality rates. Results: Notable regional disparities in under-five mortality were observed across Uttar Pradesh. Predictive accuracies ranged from 76 % to 79.4 %, with the logistic regression model achieving the highest accuracy (79.4 %). All ML models demonstrated comparable predictive capabilities. The most effective model identified key determinants of under-five mortality, including breastfeeding status, number of births in the preceding five years, child's gender, birth intervals, antenatal care, birth order, type of water source, and maternal body mass index. Conclusion: Machine learning models provide valuable insights into the determinants of under-five mortality, with the logistic regression model demonstrating superior predictive performance. Policy measures targeting critical factors, such as promoting breastfeeding, optimizing birth intervals, and improving maternal health and antenatal care, can significantly enhance childhood survival rates in Uttar Pradesh.
format Article
id doaj-art-c4a67da46b8c456eb125d27cecd4ea01
institution Kabale University
issn 2213-3984
language English
publishDate 2025-03-01
publisher Elsevier
record_format Article
series Clinical Epidemiology and Global Health
spelling doaj-art-c4a67da46b8c456eb125d27cecd4ea012025-02-10T04:34:22ZengElsevierClinical Epidemiology and Global Health2213-39842025-03-0132101949Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)Pinky Pandey0Sacheendra Shukla1Niraj Kumar Singh2Mukesh Kumar3Department of Statistics, Amity Institute of Applied Sciences, AUUP, India; Corresponding author.Department of Statistics, Amity Institute of Applied Sciences, AUUP, IndiaDepartment of Statistics, Amity Institute of Applied Sciences, AUUP, IndiaDepartment of Statistics, Lucknow University, IndiaAim: This study aimed to delineate spatial variations in under-five mortality across Uttar Pradesh and evaluate the efficacy of various machine learning algorithms in identifying critical determinants influencing these mortality rates. Methods: The study utilized data from the National Family and Health Survey (NFHS) - V. Four machine learning algorithms—Random Forests, Logistic Regression, K-Nearest Neighbors (KNN), and Naive Bayes—were applied alongside a traditional logistic regression model. Predictive performance was evaluated using metrics such as model accuracy and receiver operating characteristic (ROC) curves. Descriptive analysis highlighted regional variations in under-five mortality rates. Results: Notable regional disparities in under-five mortality were observed across Uttar Pradesh. Predictive accuracies ranged from 76 % to 79.4 %, with the logistic regression model achieving the highest accuracy (79.4 %). All ML models demonstrated comparable predictive capabilities. The most effective model identified key determinants of under-five mortality, including breastfeeding status, number of births in the preceding five years, child's gender, birth intervals, antenatal care, birth order, type of water source, and maternal body mass index. Conclusion: Machine learning models provide valuable insights into the determinants of under-five mortality, with the logistic regression model demonstrating superior predictive performance. Policy measures targeting critical factors, such as promoting breastfeeding, optimizing birth intervals, and improving maternal health and antenatal care, can significantly enhance childhood survival rates in Uttar Pradesh.http://www.sciencedirect.com/science/article/pii/S2213398425000387Under-five mortalityMachine LearningKNNRandom ForestROC curve
spellingShingle Pinky Pandey
Sacheendra Shukla
Niraj Kumar Singh
Mukesh Kumar
Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
Clinical Epidemiology and Global Health
Under-five mortality
Machine Learning
KNN
Random Forest
ROC curve
title Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
title_full Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
title_fullStr Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
title_full_unstemmed Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
title_short Predicting child mortality determinants in Uttar Pradesh using Machine Learning: Insights from the National Family and Health Survey (2019–21)
title_sort predicting child mortality determinants in uttar pradesh using machine learning insights from the national family and health survey 2019 21
topic Under-five mortality
Machine Learning
KNN
Random Forest
ROC curve
url http://www.sciencedirect.com/science/article/pii/S2213398425000387
work_keys_str_mv AT pinkypandey predictingchildmortalitydeterminantsinuttarpradeshusingmachinelearninginsightsfromthenationalfamilyandhealthsurvey201921
AT sacheendrashukla predictingchildmortalitydeterminantsinuttarpradeshusingmachinelearninginsightsfromthenationalfamilyandhealthsurvey201921
AT nirajkumarsingh predictingchildmortalitydeterminantsinuttarpradeshusingmachinelearninginsightsfromthenationalfamilyandhealthsurvey201921
AT mukeshkumar predictingchildmortalitydeterminantsinuttarpradeshusingmachinelearninginsightsfromthenationalfamilyandhealthsurvey201921