Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study

Abstract Background Older adults with chronic diseases are at higher risk of depressive symptoms than those without. For the onset of depressive symptoms, the prediction ability of changes in common risk factors over a 2-year follow-up period is unclear in the Chinese older population. This study ai...

Full description

Saved in:
Bibliographic Details
Main Authors: Ying Zheng, Taotao Zhang, Shu Yang, Fuzhi Wang, Li Zhang, Yuwen Liu
Format: Article
Language:English
Published: BMC 2024-12-01
Series:BMC Psychiatry
Subjects:
Online Access:https://doi.org/10.1186/s12888-024-06299-6
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849220647507460096
author Ying Zheng
Taotao Zhang
Shu Yang
Fuzhi Wang
Li Zhang
Yuwen Liu
author_facet Ying Zheng
Taotao Zhang
Shu Yang
Fuzhi Wang
Li Zhang
Yuwen Liu
author_sort Ying Zheng
collection DOAJ
description Abstract Background Older adults with chronic diseases are at higher risk of depressive symptoms than those without. For the onset of depressive symptoms, the prediction ability of changes in common risk factors over a 2-year follow-up period is unclear in the Chinese older population. This study aimed to build risk prediction models (RPMs) to estimate the probability of incident 2-year depression using data from the China Health and Retirement Longitudinal Study (CHARLS). Methods Four ML algorithms (logistic regression [LR], AdaBoost, random forest [RF] and k-nearest neighbor [kNN]) were applied to develop RPMs using the 2011–2015 cohort data. These developed models were then validated with 2018–2020 survey data. We evaluated the model performance using discrimination and calibration metrics, including an area under the receiver operating characteristic curve (AUROC) and the precision-recall curve (AUPRC), accuracy, sensitivity and calibrations plot. Finally, we explored the key factors of depressive symptoms by the selected best predictive models. Results This study finally included 7,121 participants to build models to predict depressive symptoms, finding a 21.5% prevalence of depression. Combining the Synthetic Minority Oversampling Technique (SMOTE) with the logistic regression model (LR-SM) exhibited superior precision to predict depression than other models, with an AUROC and AUPRC of 0.612 and 0.468, respectively, an accuracy of 0.619 and a sensitivity of 0.546. In additiona, external validation of the LR-SM model using data from the 2018–2020 data also demonstrated good predictive ability with an AUROC of 0.623 (95% CI: 0.555– 0.673). Sex, self-rated health status, occupation, eyesight, memory and life satisfaction were identified as impactful predictors of depression. Conclusions Our developed models exhibited high accuracy, good discrimination and calibration profiles in predicting two-year risk of depression among older adults with chronic diseases. This model can be used to identify Chinese older population at high risk of depression and intervene in a timely manner.
format Article
id doaj-art-4f2d6a6cfb6040ad91bf2d817cb863fa
institution Kabale University
issn 1471-244X
language English
publishDate 2024-12-01
publisher BMC
record_format Article
series BMC Psychiatry
spelling doaj-art-4f2d6a6cfb6040ad91bf2d817cb863fa2024-12-08T12:39:42ZengBMCBMC Psychiatry1471-244X2024-12-0124111110.1186/s12888-024-06299-6Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort studyYing Zheng0Taotao Zhang1Shu Yang2Fuzhi Wang3Li Zhang4Yuwen Liu5School of Nursing, Bengbu Medical UniversitySchool of Nursing, Bengbu Medical UniversitySchool of Health Management, Bengbu Medical UniversitySchool of Health Management, Bengbu Medical UniversitySchool of Nursing, Bengbu Medical UniversitySchool of Health Management, Bengbu Medical UniversityAbstract Background Older adults with chronic diseases are at higher risk of depressive symptoms than those without. For the onset of depressive symptoms, the prediction ability of changes in common risk factors over a 2-year follow-up period is unclear in the Chinese older population. This study aimed to build risk prediction models (RPMs) to estimate the probability of incident 2-year depression using data from the China Health and Retirement Longitudinal Study (CHARLS). Methods Four ML algorithms (logistic regression [LR], AdaBoost, random forest [RF] and k-nearest neighbor [kNN]) were applied to develop RPMs using the 2011–2015 cohort data. These developed models were then validated with 2018–2020 survey data. We evaluated the model performance using discrimination and calibration metrics, including an area under the receiver operating characteristic curve (AUROC) and the precision-recall curve (AUPRC), accuracy, sensitivity and calibrations plot. Finally, we explored the key factors of depressive symptoms by the selected best predictive models. Results This study finally included 7,121 participants to build models to predict depressive symptoms, finding a 21.5% prevalence of depression. Combining the Synthetic Minority Oversampling Technique (SMOTE) with the logistic regression model (LR-SM) exhibited superior precision to predict depression than other models, with an AUROC and AUPRC of 0.612 and 0.468, respectively, an accuracy of 0.619 and a sensitivity of 0.546. In additiona, external validation of the LR-SM model using data from the 2018–2020 data also demonstrated good predictive ability with an AUROC of 0.623 (95% CI: 0.555– 0.673). Sex, self-rated health status, occupation, eyesight, memory and life satisfaction were identified as impactful predictors of depression. Conclusions Our developed models exhibited high accuracy, good discrimination and calibration profiles in predicting two-year risk of depression among older adults with chronic diseases. This model can be used to identify Chinese older population at high risk of depression and intervene in a timely manner.https://doi.org/10.1186/s12888-024-06299-6Chronic diseasesRisk factorsMachine learningDepressive symptoms
spellingShingle Ying Zheng
Taotao Zhang
Shu Yang
Fuzhi Wang
Li Zhang
Yuwen Liu
Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
BMC Psychiatry
Chronic diseases
Risk factors
Machine learning
Depressive symptoms
title Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
title_full Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
title_fullStr Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
title_full_unstemmed Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
title_short Using machine learning to predict the probability of incident 2-year depression in older adults with chronic diseases: a retrospective cohort study
title_sort using machine learning to predict the probability of incident 2 year depression in older adults with chronic diseases a retrospective cohort study
topic Chronic diseases
Risk factors
Machine learning
Depressive symptoms
url https://doi.org/10.1186/s12888-024-06299-6
work_keys_str_mv AT yingzheng usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy
AT taotaozhang usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy
AT shuyang usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy
AT fuzhiwang usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy
AT lizhang usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy
AT yuwenliu usingmachinelearningtopredicttheprobabilityofincident2yeardepressioninolderadultswithchronicdiseasesaretrospectivecohortstudy