Type 2 diabetes risk prediction using glycemic control Metrics: A machine learning approach
Background: Type 2 Diabetes Mellitus (T2DM) remains a significant global health burden, particularly in low- and middle-income settings. Conventional prevention strategies often lack personalization, overlooking individual variability in lifestyle, nutrition, and health status. This study aimed to d...
Saved in:
| Main Authors: | , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-12-01
|
| Series: | Human Nutrition & Metabolism |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2666149725000453 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Background: Type 2 Diabetes Mellitus (T2DM) remains a significant global health burden, particularly in low- and middle-income settings. Conventional prevention strategies often lack personalization, overlooking individual variability in lifestyle, nutrition, and health status. This study aimed to develop a personalized T2DM risk prediction model using machine learning (ML), integrating clinical, behavioral, and dietary data, including glycemic index (GI) and glycemic load (GL) derived from actual food and recipe intake. Methods: Data from 3145 Palestinian adults (aged 18–60) were analyzed using statistical and machine learning (ML) techniques. Variables included age, sex, education, income, physical activity, smoking status, perceived health, and detailed nutritional intake, specifically glycemic index (GI) and glycemic load (GL). Nine ML models were developed using the AutoGluon-Tabular framework. Model performance was assessed via accuracy, area under the curve (AUC), and log loss. Feature importance analysis identified key predictors of T2DM risk. Results: Women had significantly higher odds of diabetes than men, while rural residents had a lower risk compared to urban dwellers. People aged 50–59 were over six times more likely to be diabetic than those aged 18–29. Lower education and poor perceived health were also strong predictors. Diabetic participants consumed significantly lower GI (87.7 ± 36.1) and GL (241 ± 180.5) diets compared to non-diabetics (GI = 98.8 ± 35.5; GL = 303.3 ± 202.7; p = 0.001). Among the ML models, XGBoost and CatBoost performed best, with over 93 % accuracy and excellent prediction scores. Glycemic load, age, BMI, waist-to-hip ratio, and self-reported health status were the most important risk indicators. Conclusion: This study showed the effectiveness of integrating machine learning with glycemic control metrics and lifestyle data for personalized T2DM prediction. Incorporating glycemic values from real food and recipe intake improved model accuracy and interpretability. These findings support the development of precision prevention strategies tailored to individual risk profiles, particularly in underserved populations. |
|---|---|
| ISSN: | 2666-1497 |