An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea

Understanding environmental disease risk factor analysis at the district level is essential for gaining valuable insights into regional disease variations, offering a broader perspective compared to individual-level studies. Recently, explainable artificial intelligence (XAI) has received increasing...

Full description

Saved in:
Bibliographic Details
Main Authors: Eunjin Kang, Dongjin Cho, Siwoo Lee, Jungho Im, Dongwook Lee, Cheolhee Yoo
Format: Article
Language:English
Published: Taylor & Francis Group 2024-12-01
Series:GIScience & Remote Sensing
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/15481603.2024.2436997
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850135430990659584
author Eunjin Kang
Dongjin Cho
Siwoo Lee
Jungho Im
Dongwook Lee
Cheolhee Yoo
author_facet Eunjin Kang
Dongjin Cho
Siwoo Lee
Jungho Im
Dongwook Lee
Cheolhee Yoo
author_sort Eunjin Kang
collection DOAJ
description Understanding environmental disease risk factor analysis at the district level is essential for gaining valuable insights into regional disease variations, offering a broader perspective compared to individual-level studies. Recently, explainable artificial intelligence (XAI) has received increasing attention in the analysis of factors affecting public health. However, previous purely data-driven XAI-based risk factor analyses faced challenges in capturing regional effect of environmental variables, leading to confusion regarding key spatiotemporal risk factors. Therefore, this study proposes a framework that includes two complementary XAI-based risk factor analyses following two assumptions. Regionally rescaled environmental variables must account for the unequal effects on environmental factors, which are likely influenced by variations in adaptation capacity to weather conditions and differences in exposure-response relationships to air pollutants. District-level disease distribution highlights geographic disparity in sociodemographic vulnerability, whereas temporal variation in diseases by district underscores temporal environmental impacts. Based on these two hypotheses, we rescaled environmental variables using two complementary schemes: one that employs the district-level disease distribution as the target variable, and another that utilizes the temporal residual of the disease within each district. We evaluated this framework by analyzing the association between cardiovascular age-standardized mortality rate (CVD-ASMR) and various risk factors in South Korea from 2010 to 2019, using high-performing random forest and light gradient boosting models with additive Shapley explanation. Compared to previous purely data-driven XAI-based analyses, the proposed schemes achieved significantly better results in capturing regional exposure-response relationships. In two complementary schemes, the most explainable factor to districts with high CVD-ASMR was low education level related to sociodemographic vulnerability, whereas the most explainable factors to high temporal CVD-ASMR patterns were low greenness and high air pollution levels. In addition, the two complementary schemes enabled us to reasonably analyze the interaction effect of the two risk factors, i.e. temperature and air pollutants. Furthermore, high CVD-ASMR and its high temporal variation were observed in situations of high sociodemographic vulnerability with poor air quality. These findings provide insightful public health planning for sustainable cities and society by pinpointing high-risk areas and tailoring strategies to address regional environmental challenges.
format Article
id doaj-art-0b2bd7c0a0764491a7ec3fec19cc0eb7
institution OA Journals
issn 1548-1603
1943-7226
language English
publishDate 2024-12-01
publisher Taylor & Francis Group
record_format Article
series GIScience & Remote Sensing
spelling doaj-art-0b2bd7c0a0764491a7ec3fec19cc0eb72025-08-20T02:31:26ZengTaylor & Francis GroupGIScience & Remote Sensing1548-16031943-72262024-12-0161110.1080/15481603.2024.2436997An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South KoreaEunjin Kang0Dongjin Cho1Siwoo Lee2Jungho Im3Dongwook Lee4Cheolhee Yoo5Department of Civil, Urban, Earth, and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South KoreaDepartment of Civil, Urban, Earth, and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South KoreaDepartment of Civil, Urban, Earth, and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South KoreaDepartment of Civil, Urban, Earth, and Environmental Engineering, Ulsan National Institute of Science and Technology (UNIST), Ulsan, South KoreaDepartment of Occupational and Environmental Medicine, Inha University Hospital, Inha University, Incheon, South KoreaDepartment of Land Surveying and Geo-Informatics, The Hong Kong Polytechnic University, Hung Hom, Hong KongUnderstanding environmental disease risk factor analysis at the district level is essential for gaining valuable insights into regional disease variations, offering a broader perspective compared to individual-level studies. Recently, explainable artificial intelligence (XAI) has received increasing attention in the analysis of factors affecting public health. However, previous purely data-driven XAI-based risk factor analyses faced challenges in capturing regional effect of environmental variables, leading to confusion regarding key spatiotemporal risk factors. Therefore, this study proposes a framework that includes two complementary XAI-based risk factor analyses following two assumptions. Regionally rescaled environmental variables must account for the unequal effects on environmental factors, which are likely influenced by variations in adaptation capacity to weather conditions and differences in exposure-response relationships to air pollutants. District-level disease distribution highlights geographic disparity in sociodemographic vulnerability, whereas temporal variation in diseases by district underscores temporal environmental impacts. Based on these two hypotheses, we rescaled environmental variables using two complementary schemes: one that employs the district-level disease distribution as the target variable, and another that utilizes the temporal residual of the disease within each district. We evaluated this framework by analyzing the association between cardiovascular age-standardized mortality rate (CVD-ASMR) and various risk factors in South Korea from 2010 to 2019, using high-performing random forest and light gradient boosting models with additive Shapley explanation. Compared to previous purely data-driven XAI-based analyses, the proposed schemes achieved significantly better results in capturing regional exposure-response relationships. In two complementary schemes, the most explainable factor to districts with high CVD-ASMR was low education level related to sociodemographic vulnerability, whereas the most explainable factors to high temporal CVD-ASMR patterns were low greenness and high air pollution levels. In addition, the two complementary schemes enabled us to reasonably analyze the interaction effect of the two risk factors, i.e. temperature and air pollutants. Furthermore, high CVD-ASMR and its high temporal variation were observed in situations of high sociodemographic vulnerability with poor air quality. These findings provide insightful public health planning for sustainable cities and society by pinpointing high-risk areas and tailoring strategies to address regional environmental challenges.https://www.tandfonline.com/doi/10.1080/15481603.2024.2436997Explainable artificial intelligencespatiotemporal risk factor analysiscommunity healthlocal exposure-responsevulnerability
spellingShingle Eunjin Kang
Dongjin Cho
Siwoo Lee
Jungho Im
Dongwook Lee
Cheolhee Yoo
An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
GIScience & Remote Sensing
Explainable artificial intelligence
spatiotemporal risk factor analysis
community health
local exposure-response
vulnerability
title An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
title_full An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
title_fullStr An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
title_full_unstemmed An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
title_short An explainable AI framework for spatiotemporal risk factor analysis in public health: a case study of cardiovascular mortality in South Korea
title_sort explainable ai framework for spatiotemporal risk factor analysis in public health a case study of cardiovascular mortality in south korea
topic Explainable artificial intelligence
spatiotemporal risk factor analysis
community health
local exposure-response
vulnerability
url https://www.tandfonline.com/doi/10.1080/15481603.2024.2436997
work_keys_str_mv AT eunjinkang anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT dongjincho anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT siwoolee anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT junghoim anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT dongwooklee anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT cheolheeyoo anexplainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT eunjinkang explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT dongjincho explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT siwoolee explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT junghoim explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT dongwooklee explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea
AT cheolheeyoo explainableaiframeworkforspatiotemporalriskfactoranalysisinpublichealthacasestudyofcardiovascularmortalityinsouthkorea