Development and validation of predictive models for diabetic retinopathy using machine learning.

<h4>Objective</h4>This study aimed to develop and compare machine learning models for predicting diabetic retinopathy (DR) using clinical and biochemical data, specifically logistic regression, random forest, XGBoost, and neural networks.<h4>Methods</h4>A dataset of 3,000 dia...

Full description

Saved in:
Bibliographic Details
Main Authors: Penglu Yang, Bin Yang
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0318226
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:<h4>Objective</h4>This study aimed to develop and compare machine learning models for predicting diabetic retinopathy (DR) using clinical and biochemical data, specifically logistic regression, random forest, XGBoost, and neural networks.<h4>Methods</h4>A dataset of 3,000 diabetic patients, including 1,500 with DR, was obtained from the National Population Health Science Data Center. Significant predictors were identified, and four predictive models were developed. Model performance was assessed using accuracy, precision, recall, F1-score, and area under the curve (AUC).<h4>Results</h4>Random forest and XGBoost demonstrated superior performance, achieving accuracies of 95.67% and 94.67%, respectively, with AUC values of 0.991 and 0.989. Logistic regression yielded an accuracy of 76.50% (AUC: 0.828), while neural networks achieved 82.67% accuracy (AUC: 0.927). Key predictors included 24-hour urinary microalbumin, HbA1c, and serum creatinine.<h4>Conclusion</h4>The study highlights random forest and XGBoost as effective tools for early DR detection, emphasizing the importance of renal and glycemic markers in risk assessment. These findings support the integration of machine learning models into clinical decision-making for improved patient outcomes in diabetes management.
ISSN:1932-6203