Prediction of respiratory diseases based on random forest model

In recent years, the random forest model has been widely applied to analyze the relationships among air pollution, meteorological factors, and human health. To investigate the patterns and influencing factors of respiratory disease-related medical visits, this study utilized data on medical visits f...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaotong Yang, Yi Li, Lang Liu, Zengliang Zang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-02-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpubh.2025.1537238/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849721750176137216
author Xiaotong Yang
Xiaotong Yang
Yi Li
Yi Li
Lang Liu
Lang Liu
Zengliang Zang
Zengliang Zang
author_facet Xiaotong Yang
Xiaotong Yang
Yi Li
Yi Li
Lang Liu
Lang Liu
Zengliang Zang
Zengliang Zang
author_sort Xiaotong Yang
collection DOAJ
description In recent years, the random forest model has been widely applied to analyze the relationships among air pollution, meteorological factors, and human health. To investigate the patterns and influencing factors of respiratory disease-related medical visits, this study utilized data on medical visits from urban areas of Tianjin, meteorological observations, and pollution data. First, the temporal variation characteristics of medical visits from 2013 to 2019 were analyzed. Subsequently, the random forest model was employed to identify the dominant influencing factors of respiratory disease-related medical visits and to construct a statistical forecasting model that relates these factors to the number of visits. Additionally, a predictive analysis of medical visits in Tianjin for the year 2019 was conducted. The results indicate the following: (1) From 2013 to 2019, the number of medical visits exhibited seasonal fluctuations, with a significant decline observed in 2017, which may be directly related to adjustments in hospital policies. (2) Among the meteorological factors, average temperature, relative humidity, precipitation, and ozone concentration significantly influenced the variation in medical visits, while wind speed, precipitation amount, and boundary layer height were of lesser importance. Furthermore, different linear relationships exist among the meteorological factors; specifically, meteorological factors show a negative correlation with pollutant elements, and there is a strong correlation among the pollutant factors. (3) When the number of medical visits ranged from 50 to 200, the predictions made by the random forest model closely matched the actual values, demonstrating strong predictive performance and the ability to effectively forecast daily variations in medical visits over extended periods, thus exhibiting good stability and generalization capability. (4) However, since the random forest model relies on a large amount of data for model validation, it has limitations in capturing extreme variations in medical visit numbers. Future research could address this issue by integrating different models to enhance predictive capabilities.
format Article
id doaj-art-3f205e664bf34b7d8463b0af69b57eaf
institution DOAJ
issn 2296-2565
language English
publishDate 2025-02-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Public Health
spelling doaj-art-3f205e664bf34b7d8463b0af69b57eaf2025-08-20T03:11:33ZengFrontiers Media S.A.Frontiers in Public Health2296-25652025-02-011310.3389/fpubh.2025.15372381537238Prediction of respiratory diseases based on random forest modelXiaotong Yang0Xiaotong Yang1Yi Li2Yi Li3Lang Liu4Lang Liu5Zengliang Zang6Zengliang Zang7College of Meteorology and Oceanography, National University of Defense Technology, Changsha, ChinaKey Laboratory of High Impact Weather (Special), China Meteorological Administration, Changsha, ChinaCollege of Meteorology and Oceanography, National University of Defense Technology, Changsha, ChinaKey Laboratory of High Impact Weather (Special), China Meteorological Administration, Changsha, ChinaCollege of Meteorology and Oceanography, National University of Defense Technology, Changsha, ChinaKey Laboratory of High Impact Weather (Special), China Meteorological Administration, Changsha, ChinaCollege of Meteorology and Oceanography, National University of Defense Technology, Changsha, ChinaKey Laboratory of High Impact Weather (Special), China Meteorological Administration, Changsha, ChinaIn recent years, the random forest model has been widely applied to analyze the relationships among air pollution, meteorological factors, and human health. To investigate the patterns and influencing factors of respiratory disease-related medical visits, this study utilized data on medical visits from urban areas of Tianjin, meteorological observations, and pollution data. First, the temporal variation characteristics of medical visits from 2013 to 2019 were analyzed. Subsequently, the random forest model was employed to identify the dominant influencing factors of respiratory disease-related medical visits and to construct a statistical forecasting model that relates these factors to the number of visits. Additionally, a predictive analysis of medical visits in Tianjin for the year 2019 was conducted. The results indicate the following: (1) From 2013 to 2019, the number of medical visits exhibited seasonal fluctuations, with a significant decline observed in 2017, which may be directly related to adjustments in hospital policies. (2) Among the meteorological factors, average temperature, relative humidity, precipitation, and ozone concentration significantly influenced the variation in medical visits, while wind speed, precipitation amount, and boundary layer height were of lesser importance. Furthermore, different linear relationships exist among the meteorological factors; specifically, meteorological factors show a negative correlation with pollutant elements, and there is a strong correlation among the pollutant factors. (3) When the number of medical visits ranged from 50 to 200, the predictions made by the random forest model closely matched the actual values, demonstrating strong predictive performance and the ability to effectively forecast daily variations in medical visits over extended periods, thus exhibiting good stability and generalization capability. (4) However, since the random forest model relies on a large amount of data for model validation, it has limitations in capturing extreme variations in medical visit numbers. Future research could address this issue by integrating different models to enhance predictive capabilities.https://www.frontiersin.org/articles/10.3389/fpubh.2025.1537238/fullrandom forestpredictionmeteorological factorshuman healthpollutant
spellingShingle Xiaotong Yang
Xiaotong Yang
Yi Li
Yi Li
Lang Liu
Lang Liu
Zengliang Zang
Zengliang Zang
Prediction of respiratory diseases based on random forest model
Frontiers in Public Health
random forest
prediction
meteorological factors
human health
pollutant
title Prediction of respiratory diseases based on random forest model
title_full Prediction of respiratory diseases based on random forest model
title_fullStr Prediction of respiratory diseases based on random forest model
title_full_unstemmed Prediction of respiratory diseases based on random forest model
title_short Prediction of respiratory diseases based on random forest model
title_sort prediction of respiratory diseases based on random forest model
topic random forest
prediction
meteorological factors
human health
pollutant
url https://www.frontiersin.org/articles/10.3389/fpubh.2025.1537238/full
work_keys_str_mv AT xiaotongyang predictionofrespiratorydiseasesbasedonrandomforestmodel
AT xiaotongyang predictionofrespiratorydiseasesbasedonrandomforestmodel
AT yili predictionofrespiratorydiseasesbasedonrandomforestmodel
AT yili predictionofrespiratorydiseasesbasedonrandomforestmodel
AT langliu predictionofrespiratorydiseasesbasedonrandomforestmodel
AT langliu predictionofrespiratorydiseasesbasedonrandomforestmodel
AT zengliangzang predictionofrespiratorydiseasesbasedonrandomforestmodel
AT zengliangzang predictionofrespiratorydiseasesbasedonrandomforestmodel