Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach

Yu-Ruey Liu,1– 3 Oswald Ndi Nfor,4 Ji-Han Zhong,4 Chun-Yuan Lin,3 Yung-Po Liaw4– 6 1College of Information and Electrical Engineering, Asia University, Taichung, 413, Taiwan; 2Department of Emergency Medicine, Cheng Ching General Hospital, Taichung, Taiwan; 3Department of Computer Science and Inform...

Full description

Saved in:
Bibliographic Details
Main Authors: Liu YR, Nfor ON, Zhong JH, Lin CY, Liaw YP
Format: Article
Language:English
Published: Dove Medical Press 2024-11-01
Series:Journal of Inflammation Research
Subjects:
Online Access:https://www.dovepress.com/risk-factors-for-gout-in-taiwan-biobank-a-machine-learning-approach-peer-reviewed-fulltext-article-JIR
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850217405592109056
author Liu YR
Nfor ON
Zhong JH
Lin CY
Liaw YP
author_facet Liu YR
Nfor ON
Zhong JH
Lin CY
Liaw YP
author_sort Liu YR
collection DOAJ
description Yu-Ruey Liu,1– 3 Oswald Ndi Nfor,4 Ji-Han Zhong,4 Chun-Yuan Lin,3 Yung-Po Liaw4– 6 1College of Information and Electrical Engineering, Asia University, Taichung, 413, Taiwan; 2Department of Emergency Medicine, Cheng Ching General Hospital, Taichung, Taiwan; 3Department of Computer Science and Information Engineering, Asia University, Taichung, 413, Taiwan; 4Department of Public Health and Institute of Public Health, Chung Shan Medical University, Taichung, Taiwan; 5Institute of Medicine, Chung Shan Medical University, Taichung, Taiwan; 6Department of Medical Imaging, Chung Shan Medical University Hospital, Taichung, TaiwanCorrespondence: Yung-Po Liaw, Department of Public Health and Institute of Public Health, Chung Shan Medical University, No. 110, Sec. 1 Jianguo N. Road, Taichung, 40201, Taiwan, Tel +886-4-36097501, Email Liawyp@csmu.edu.tw Chun-Yuan Lin, Department of Computer Science and Information Engineering, Asia University, No. 500, Lioufeng Road, Wufeng, Taichung, 413, Taiwan, Tel +886-4-2332-3456 # 1814, Email cyulin@asia.edu.twPurpose: We assessed the risk of gout in the Taiwan Biobank population by applying various machine learning algorithms. The study aimed to identify crucial risk factors and evaluate the performance of different models in gout prediction.Patients and Methods: This study analyzed data from 88,210 individuals in the Taiwan Biobank, identifying 19,338 cases of gout and 68,872 controls. After data cleaning and propensity score matching for gender and age, the final analytical sample comprised 38,676 individuals (19,338 gout cases and 19,338 controls). Five machine learning models were used: Bayesian Network (BN), Random Forest (RF), Gradient Boosting (GB), Logistic Regression (LR), and Neural Network (NN). The predictive performance was evaluated using a split dataset (80% training set and 20% test set).Results: Variable importance analysis was performed to identify key variables, with uric acid and gender emerging as the most influential risk factors. Descriptive data highlighted significant differences between the control group and gout patients, with a higher prevalence of gout in men (51.36% vs 48.64%). Both the RF and GB demonstrated high performance across multiple metrics, with RF consistently achieving a high area under the curve (AUC) of 0.986 to 0.987, alongside excellent sensitivity (0.945– 0.947) and specificity (0.998– 0.999). GB also performed robustly, with AUC values around 0.987– 0.988 and maintaining high sensitivity (0.944– 0.950) and specificity (0.995– 0.999) across different model variations. The F1 scores for both models (GB and RF) indicate strong predictive capabilities, with values around 0.971– 0.972.Conclusion: The RF and GB demonstrated exceptional accuracy in predicting gout status, particularly when incorporating genetic data alongside clinical factors. These findings underscore the potential for integrating machine learning models with genetic information to enhance gout prediction accuracy in clinical practice.Keywords: risk prediction, gout, machine learning, artificial intelligence
format Article
id doaj-art-8079a3db444e434782eb985cf69af7fc
institution OA Journals
issn 1178-7031
language English
publishDate 2024-11-01
publisher Dove Medical Press
record_format Article
series Journal of Inflammation Research
spelling doaj-art-8079a3db444e434782eb985cf69af7fc2025-08-20T02:08:03ZengDove Medical PressJournal of Inflammation Research1178-70312024-11-01Volume 179847985697733Risk Factors for Gout in Taiwan Biobank: A Machine Learning ApproachLiu YRNfor ONZhong JHLin CYLiaw YPYu-Ruey Liu,1– 3 Oswald Ndi Nfor,4 Ji-Han Zhong,4 Chun-Yuan Lin,3 Yung-Po Liaw4– 6 1College of Information and Electrical Engineering, Asia University, Taichung, 413, Taiwan; 2Department of Emergency Medicine, Cheng Ching General Hospital, Taichung, Taiwan; 3Department of Computer Science and Information Engineering, Asia University, Taichung, 413, Taiwan; 4Department of Public Health and Institute of Public Health, Chung Shan Medical University, Taichung, Taiwan; 5Institute of Medicine, Chung Shan Medical University, Taichung, Taiwan; 6Department of Medical Imaging, Chung Shan Medical University Hospital, Taichung, TaiwanCorrespondence: Yung-Po Liaw, Department of Public Health and Institute of Public Health, Chung Shan Medical University, No. 110, Sec. 1 Jianguo N. Road, Taichung, 40201, Taiwan, Tel +886-4-36097501, Email Liawyp@csmu.edu.tw Chun-Yuan Lin, Department of Computer Science and Information Engineering, Asia University, No. 500, Lioufeng Road, Wufeng, Taichung, 413, Taiwan, Tel +886-4-2332-3456 # 1814, Email cyulin@asia.edu.twPurpose: We assessed the risk of gout in the Taiwan Biobank population by applying various machine learning algorithms. The study aimed to identify crucial risk factors and evaluate the performance of different models in gout prediction.Patients and Methods: This study analyzed data from 88,210 individuals in the Taiwan Biobank, identifying 19,338 cases of gout and 68,872 controls. After data cleaning and propensity score matching for gender and age, the final analytical sample comprised 38,676 individuals (19,338 gout cases and 19,338 controls). Five machine learning models were used: Bayesian Network (BN), Random Forest (RF), Gradient Boosting (GB), Logistic Regression (LR), and Neural Network (NN). The predictive performance was evaluated using a split dataset (80% training set and 20% test set).Results: Variable importance analysis was performed to identify key variables, with uric acid and gender emerging as the most influential risk factors. Descriptive data highlighted significant differences between the control group and gout patients, with a higher prevalence of gout in men (51.36% vs 48.64%). Both the RF and GB demonstrated high performance across multiple metrics, with RF consistently achieving a high area under the curve (AUC) of 0.986 to 0.987, alongside excellent sensitivity (0.945– 0.947) and specificity (0.998– 0.999). GB also performed robustly, with AUC values around 0.987– 0.988 and maintaining high sensitivity (0.944– 0.950) and specificity (0.995– 0.999) across different model variations. The F1 scores for both models (GB and RF) indicate strong predictive capabilities, with values around 0.971– 0.972.Conclusion: The RF and GB demonstrated exceptional accuracy in predicting gout status, particularly when incorporating genetic data alongside clinical factors. These findings underscore the potential for integrating machine learning models with genetic information to enhance gout prediction accuracy in clinical practice.Keywords: risk prediction, gout, machine learning, artificial intelligencehttps://www.dovepress.com/risk-factors-for-gout-in-taiwan-biobank-a-machine-learning-approach-peer-reviewed-fulltext-article-JIRrisk predictiongoutmachine learningartificial intelligence
spellingShingle Liu YR
Nfor ON
Zhong JH
Lin CY
Liaw YP
Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
Journal of Inflammation Research
risk prediction
gout
machine learning
artificial intelligence
title Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
title_full Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
title_fullStr Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
title_full_unstemmed Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
title_short Risk Factors for Gout in Taiwan Biobank: A Machine Learning Approach
title_sort risk factors for gout in taiwan biobank a machine learning approach
topic risk prediction
gout
machine learning
artificial intelligence
url https://www.dovepress.com/risk-factors-for-gout-in-taiwan-biobank-a-machine-learning-approach-peer-reviewed-fulltext-article-JIR
work_keys_str_mv AT liuyr riskfactorsforgoutintaiwanbiobankamachinelearningapproach
AT nforon riskfactorsforgoutintaiwanbiobankamachinelearningapproach
AT zhongjh riskfactorsforgoutintaiwanbiobankamachinelearningapproach
AT lincy riskfactorsforgoutintaiwanbiobankamachinelearningapproach
AT liawyp riskfactorsforgoutintaiwanbiobankamachinelearningapproach