Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis
Abstract Diabetes Mellitus is a chronic metabolic disorder affecting a substantial global population leading to complications such as retinopathy, nephropathy, neuropathy, foot problems, heart attacks, and strokes if left unchecked. Prompt detection and diagnosis are crucial in managing and averting...
Saved in:
| Main Authors: | , , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-06-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-03718-w |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849434226346164224 |
|---|---|
| author | Idris Zubairu Sadiq Babangida Sanusi Katsayal Bashiru Ibrahim Maryam Ibrahim Hassan Aliyu Hassan Umar Muhammad Ghali Abdullahi Garba Usman Abubakar Usman Sani Isah Abba |
| author_facet | Idris Zubairu Sadiq Babangida Sanusi Katsayal Bashiru Ibrahim Maryam Ibrahim Hassan Aliyu Hassan Umar Muhammad Ghali Abdullahi Garba Usman Abubakar Usman Sani Isah Abba |
| author_sort | Idris Zubairu Sadiq |
| collection | DOAJ |
| description | Abstract Diabetes Mellitus is a chronic metabolic disorder affecting a substantial global population leading to complications such as retinopathy, nephropathy, neuropathy, foot problems, heart attacks, and strokes if left unchecked. Prompt detection and diagnosis are crucial in managing and averting these complications. This study compares the effectiveness of a Decision Tree Classifier and an Artificial Neural Network (ANN) in predicting Diabetes Mellitus. The Decision Tree Classifier demonstrated superior performance, achieving a 97.7% accuracy rate compared to the ANN’s 94.7%. The Decision Tree Classifier also achieved higher precision (96.9% vs. 88.8%) and recall (96.5% vs. 90.2%) than the ANN, along with a balanced F1 score of 96.5% versus 90.2%. The Matthews Correlation Coefficient (MCC) confirmed a stronger correlation between predictions and actual labels for the Decision Tree Classifier (87.4%) compared to the ANN (78%). Furthermore, the Area Under Curve (AUC) score of 96% for the Decision Tree Classifier was higher than that of ANN (78%). The relative importance feature analysis clearly established glycated hemoglobin (HbA1c) as the paramount factor in predicting diabetes mellitus. Diabetic patients showed markedly higher cholesterol and triglycerides, increasing cardiovascular risk, while High Density Lipoprotein (HDL) and Low-Density Lipoprotein (LDL) levels showed no significant difference between diabetics and non-diabetics. However, Very Low-Density Lipoprotein (VLDL) was significantly elevated, suggesting altered lipid transport in diabetes. Body Mass Index (BMI) was also notably higher in diabetics, reinforcing the link between obesity and diabetes risk. Principal Component analysis further highlighted five clusters of health-related variables, identifying age-related metabolic indicators (AGE, HbA1c, BMI), kidney function markers (creatinine (Cr), Urea), cardiovascular lipid profiles (Cholesterol, LDL), lipid transport (VLDL), and protective cardiovascular indicator (HDL). The study highlights the superiority of decision tree classifier in predicting Diabetes Mellitus, suggesting its potential for significant clinical applications in diagnosis and management. |
| format | Article |
| id | doaj-art-d91f5a71f4294eb5b7336774cea4e295 |
| institution | Kabale University |
| issn | 2045-2322 |
| language | English |
| publishDate | 2025-06-01 |
| publisher | Nature Portfolio |
| record_format | Article |
| series | Scientific Reports |
| spelling | doaj-art-d91f5a71f4294eb5b7336774cea4e2952025-08-20T03:26:44ZengNature PortfolioScientific Reports2045-23222025-06-0115111610.1038/s41598-025-03718-wData-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysisIdris Zubairu Sadiq0Babangida Sanusi Katsayal1Bashiru Ibrahim2Maryam Ibrahim3Hassan Aliyu Hassan4Umar Muhammad Ghali5Abdullahi Garba Usman6Abubakar Usman7Sani Isah Abba8Department of Biochemistry, Faculty of Life Sciences, Ahmadu Bello UniversityDepartment of Biochemistry, Faculty of Life Sciences, Ahmadu Bello UniversityDepartment of Biochemistry, Faculty of Life Sciences, Ahmadu Bello UniversityDepartment of Biochemistry and Molecular Biology, Faculty of Life Sciences, Federal UniversityDepartment of Biochemistry, Federal UniversityDepartment of Chemistry, Faculty of Science, Cankiri Karatekin UniversityOperational Research Centre in Healthcare, Near East UniversityDepartment of Statistics, Faculty of Physical Sciences, Ahmadu Bello UniversityDepartment of Chemical Engineering, Prince Mohammad Bin Fahd UniversityAbstract Diabetes Mellitus is a chronic metabolic disorder affecting a substantial global population leading to complications such as retinopathy, nephropathy, neuropathy, foot problems, heart attacks, and strokes if left unchecked. Prompt detection and diagnosis are crucial in managing and averting these complications. This study compares the effectiveness of a Decision Tree Classifier and an Artificial Neural Network (ANN) in predicting Diabetes Mellitus. The Decision Tree Classifier demonstrated superior performance, achieving a 97.7% accuracy rate compared to the ANN’s 94.7%. The Decision Tree Classifier also achieved higher precision (96.9% vs. 88.8%) and recall (96.5% vs. 90.2%) than the ANN, along with a balanced F1 score of 96.5% versus 90.2%. The Matthews Correlation Coefficient (MCC) confirmed a stronger correlation between predictions and actual labels for the Decision Tree Classifier (87.4%) compared to the ANN (78%). Furthermore, the Area Under Curve (AUC) score of 96% for the Decision Tree Classifier was higher than that of ANN (78%). The relative importance feature analysis clearly established glycated hemoglobin (HbA1c) as the paramount factor in predicting diabetes mellitus. Diabetic patients showed markedly higher cholesterol and triglycerides, increasing cardiovascular risk, while High Density Lipoprotein (HDL) and Low-Density Lipoprotein (LDL) levels showed no significant difference between diabetics and non-diabetics. However, Very Low-Density Lipoprotein (VLDL) was significantly elevated, suggesting altered lipid transport in diabetes. Body Mass Index (BMI) was also notably higher in diabetics, reinforcing the link between obesity and diabetes risk. Principal Component analysis further highlighted five clusters of health-related variables, identifying age-related metabolic indicators (AGE, HbA1c, BMI), kidney function markers (creatinine (Cr), Urea), cardiovascular lipid profiles (Cholesterol, LDL), lipid transport (VLDL), and protective cardiovascular indicator (HDL). The study highlights the superiority of decision tree classifier in predicting Diabetes Mellitus, suggesting its potential for significant clinical applications in diagnosis and management.https://doi.org/10.1038/s41598-025-03718-wDiabetes mellitusDecision tree classifierArtificial neural networkMachine learningModellingPrediction |
| spellingShingle | Idris Zubairu Sadiq Babangida Sanusi Katsayal Bashiru Ibrahim Maryam Ibrahim Hassan Aliyu Hassan Umar Muhammad Ghali Abdullahi Garba Usman Abubakar Usman Sani Isah Abba Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis Scientific Reports Diabetes mellitus Decision tree classifier Artificial neural network Machine learning Modelling Prediction |
| title | Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| title_full | Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| title_fullStr | Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| title_full_unstemmed | Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| title_short | Data-driven diabetes mellitus prediction and management: a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| title_sort | data driven diabetes mellitus prediction and management a comparative evaluation of decision tree classifier and artificial neural network models along with statistical analysis |
| topic | Diabetes mellitus Decision tree classifier Artificial neural network Machine learning Modelling Prediction |
| url | https://doi.org/10.1038/s41598-025-03718-w |
| work_keys_str_mv | AT idriszubairusadiq datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT babangidasanusikatsayal datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT bashiruibrahim datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT maryamibrahim datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT hassanaliyuhassan datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT umarmuhammadghali datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT abdullahigarbausman datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT abubakarusman datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis AT saniisahabba datadrivendiabetesmellituspredictionandmanagementacomparativeevaluationofdecisiontreeclassifierandartificialneuralnetworkmodelsalongwithstatisticalanalysis |