Explainable machine learning-driven models for predicting Parkinson’s disease and its prognosis: obesity patterns associations and models development using NHANES 1999–2018 data
Abstract Background Parkinson's disease (PD) is a prevalent neurodegenerative condition, the effect of obesity on PD remains controversial. We aimed to investigate the associations of obesity patterns on PD and all-cause mortality, while developing machine learning (ML)-driven predictive and pr...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMC
2025-07-01
|
| Series: | Lipids in Health and Disease |
| Subjects: | |
| Online Access: | https://doi.org/10.1186/s12944-025-02664-w |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| Summary: | Abstract Background Parkinson's disease (PD) is a prevalent neurodegenerative condition, the effect of obesity on PD remains controversial. We aimed to investigate the associations of obesity patterns on PD and all-cause mortality, while developing machine learning (ML)-driven predictive and prognostic models for PD. Methods Fifty-one thousand, three hundred ninety-four adults from the National Health and Nutrition Examination Survey (NHANES) 1999–2018 were classified into four obesity patterns via body mass index (BMI) and waist circumference (WC). Associations of obesity patterns with PD risk and all-cause mortality were evaluated via multivariable logistic and Cox proportional hazards regression across three adjusted models. Subgroup, sensitivity, and restricted cubic spline (RCS) analyses examined stability, robustness, and nonlinearity. An integrative ML-driven architecture identified key features to develop predictive and prognostic nomograms, validated by the area under the receiver operating characteristic curves (AUCROCs) and calibration curves. Survival differences were analyzed using Kaplan–Meier curves. Shapley additive explanations (SHAP) enhanced model explanation. Results Compound obesity significantly increased PD risk (Model 1: OR = 1.83, P < 0.001; Model 2: OR = 1.70, P = 0.002; Model 3: OR = 1.71, P = 0.006) yet correlated with reduced all-cause mortality in PD patients (Model 1: HR = 0.43, P = 0.003; Model 2: HR = 0.75, P = 0.428; Model 3: HR = 0.41, P = 0.033). Subgroup analysis revealed only HbA1c-modified association of compound obesity with PD (P interaction = 0.031). Sensitivity analyses confirmed robustness (pooled OR = 1.83, P < 0.001; pooled HR = 0.43, P = 0.003). RCS analyses revealed BMI-dependent PD risk escalation (P nonlinearity = 0.008, BMI < 45.0 kg/m2), inverted U-shaped WC-PD link (P nonlinearity < 0.001), and inverse dose–response BMI-mortality relationship (P nonlinearity = 0.003), along with multiphasic WC-mortality association (P Threshold = 0.555 at 95 cm and P Threshold = 0.091 at 118 cm). LASSO + RF identified eight features, achieving moderate performance in PD prediction (SMOTE set: AUCROC = 0.75, Brier = 0.20) and prognosis (train set: AUCROC = 0.72, Brier = 0.22) nomograms, with similar results in the test set (AUCROC = 0.70, Brier = 0.01 for prediction, 0.87 and 0.18 for prognosis). No 24-month survival differences were observed across four obesity patterns (train set: P log-rank = 0.73; test set: P log-rank = 0.32). Conclusions This study preliminarily reveals that compound obesity significantly increases PD risk yet paradoxically associates with reduced all-cause mortality in PD patients. Validated predictive and prognostic nomograms for PD achieve relatively robust performances. Nonetheless, extensive longitudinal studies are required to validate these exploratory findings more comprehensively. |
|---|---|
| ISSN: | 1476-511X |