Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study
BackgroundSocial determinants of health (SDoH), such as financial resources and housing stability, account for between 30% and 55% of people’s health outcomes. While many studies have identified strong associations between specific SDoH and health outcomes, little is known ab...
Saved in:
Main Authors: | , , , , , , , , , , , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
JMIR Publications
2025-02-01
|
Series: | Journal of Medical Internet Research |
Online Access: | https://www.jmir.org/2025/1/e48775 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1823858074262175744 |
---|---|
author | Suresh K Bhavnani Weibin Zhang Daniel Bao Mukaila Raji Veronica Ajewole Rodney Hunter Yong-Fang Kuo Susanne Schmidt Monique R Pappadis Elise Smith Alex Bokov Timothy Reistetter Shyam Visweswaran Brian Downer |
author_facet | Suresh K Bhavnani Weibin Zhang Daniel Bao Mukaila Raji Veronica Ajewole Rodney Hunter Yong-Fang Kuo Susanne Schmidt Monique R Pappadis Elise Smith Alex Bokov Timothy Reistetter Shyam Visweswaran Brian Downer |
author_sort | Suresh K Bhavnani |
collection | DOAJ |
description |
BackgroundSocial determinants of health (SDoH), such as financial resources and housing stability, account for between 30% and 55% of people’s health outcomes. While many studies have identified strong associations between specific SDoH and health outcomes, little is known about how SDoH co-occur to form subtypes critical for designing targeted interventions. Such analysis has only now become possible through the All of Us program.
ObjectiveThis study aims to analyze the All of Us dataset for addressing two research questions: (1) What are the range of and responses to survey questions related to SDoH? and (2) How do SDoH co-occur to form subtypes, and what are their risks for adverse health outcomes?
MethodsFor question 1, an expert panel analyzed the range of and responses to SDoH questions across 6 surveys in the full All of Us dataset (N=372,397; version 6). For question 2, due to systematic missingness and uneven granularity of questions across the surveys, we selected all participants with valid and complete SDoH data and used inverse probability weighting to adjust their imbalance in demographics. Next, an expert panel grouped the SDoH questions into SDoH factors to enable more consistent granularity. To identify the subtypes, we used bipartite modularity maximization for identifying SDoH biclusters and measured their significance and replicability. Next, we measured their association with 3 outcomes (depression, delayed medical care, and emergency room visits in the last year). Finally, the expert panel inferred the subtype labels, potential mechanisms, and targeted interventions.
ResultsThe question 1 analysis identified 110 SDoH questions across 4 surveys covering all 5 domains in Healthy People 2030. As the SDoH questions varied in granularity, they were categorized by an expert panel into 18 SDoH factors. The question 2 analysis (n=12,913; d=18) identified 4 biclusters with significant biclusteredness (Q=0.13; random-Q=0.11; z=7.5; P<.001) and significant replication (real Rand index=0.88; random Rand index=0.62; P<.001). Each subtype had significant associations with specific outcomes and had meaningful interpretations and potential targeted interventions. For example, the Socioeconomic barriers subtype included 6 SDoH factors (eg, not employed and food insecurity) and had a significantly higher odds ratio (4.2, 95% CI 3.5-5.1; P<.001) for depression when compared to other subtypes. The expert panel inferred implications of the results for designing interventions and health care policies based on SDoH subtypes.
ConclusionsThis study identified SDoH subtypes that had statistically significant biclusteredness and replicability, each of which had significant associations with specific adverse health outcomes and with translational implications for targeted SDoH interventions and health care policies. However, the high degree of systematic missingness requires repeating the analysis as the data become more complete by using our generalizable and scalable machine learning code available on the All of Us workbench. |
format | Article |
id | doaj-art-b8c66774a59d4a9782c619de9e7cdd7b |
institution | Kabale University |
issn | 1438-8871 |
language | English |
publishDate | 2025-02-01 |
publisher | JMIR Publications |
record_format | Article |
series | Journal of Medical Internet Research |
spelling | doaj-art-b8c66774a59d4a9782c619de9e7cdd7b2025-02-11T16:00:35ZengJMIR PublicationsJournal of Medical Internet Research1438-88712025-02-0127e4877510.2196/48775Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization StudySuresh K Bhavnanihttps://orcid.org/0000-0002-9282-8838Weibin Zhanghttps://orcid.org/0000-0001-6161-0978Daniel Baohttps://orcid.org/0000-0002-3913-0609Mukaila Rajihttps://orcid.org/0000-0002-7460-7281Veronica Ajewolehttps://orcid.org/0000-0002-6619-2979Rodney Hunterhttps://orcid.org/0009-0009-3357-6827Yong-Fang Kuohttps://orcid.org/0000-0003-1927-0927Susanne Schmidthttps://orcid.org/0000-0001-7684-1648Monique R Pappadishttps://orcid.org/0000-0003-4742-4380Elise Smithhttps://orcid.org/0000-0002-4615-8204Alex Bokovhttps://orcid.org/0000-0002-0511-9815Timothy Reistetterhttps://orcid.org/0000-0003-1732-5533Shyam Visweswaranhttps://orcid.org/0000-0002-2079-8684Brian Downerhttps://orcid.org/0000-0001-5792-0527 BackgroundSocial determinants of health (SDoH), such as financial resources and housing stability, account for between 30% and 55% of people’s health outcomes. While many studies have identified strong associations between specific SDoH and health outcomes, little is known about how SDoH co-occur to form subtypes critical for designing targeted interventions. Such analysis has only now become possible through the All of Us program. ObjectiveThis study aims to analyze the All of Us dataset for addressing two research questions: (1) What are the range of and responses to survey questions related to SDoH? and (2) How do SDoH co-occur to form subtypes, and what are their risks for adverse health outcomes? MethodsFor question 1, an expert panel analyzed the range of and responses to SDoH questions across 6 surveys in the full All of Us dataset (N=372,397; version 6). For question 2, due to systematic missingness and uneven granularity of questions across the surveys, we selected all participants with valid and complete SDoH data and used inverse probability weighting to adjust their imbalance in demographics. Next, an expert panel grouped the SDoH questions into SDoH factors to enable more consistent granularity. To identify the subtypes, we used bipartite modularity maximization for identifying SDoH biclusters and measured their significance and replicability. Next, we measured their association with 3 outcomes (depression, delayed medical care, and emergency room visits in the last year). Finally, the expert panel inferred the subtype labels, potential mechanisms, and targeted interventions. ResultsThe question 1 analysis identified 110 SDoH questions across 4 surveys covering all 5 domains in Healthy People 2030. As the SDoH questions varied in granularity, they were categorized by an expert panel into 18 SDoH factors. The question 2 analysis (n=12,913; d=18) identified 4 biclusters with significant biclusteredness (Q=0.13; random-Q=0.11; z=7.5; P<.001) and significant replication (real Rand index=0.88; random Rand index=0.62; P<.001). Each subtype had significant associations with specific outcomes and had meaningful interpretations and potential targeted interventions. For example, the Socioeconomic barriers subtype included 6 SDoH factors (eg, not employed and food insecurity) and had a significantly higher odds ratio (4.2, 95% CI 3.5-5.1; P<.001) for depression when compared to other subtypes. The expert panel inferred implications of the results for designing interventions and health care policies based on SDoH subtypes. ConclusionsThis study identified SDoH subtypes that had statistically significant biclusteredness and replicability, each of which had significant associations with specific adverse health outcomes and with translational implications for targeted SDoH interventions and health care policies. However, the high degree of systematic missingness requires repeating the analysis as the data become more complete by using our generalizable and scalable machine learning code available on the All of Us workbench.https://www.jmir.org/2025/1/e48775 |
spellingShingle | Suresh K Bhavnani Weibin Zhang Daniel Bao Mukaila Raji Veronica Ajewole Rodney Hunter Yong-Fang Kuo Susanne Schmidt Monique R Pappadis Elise Smith Alex Bokov Timothy Reistetter Shyam Visweswaran Brian Downer Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study Journal of Medical Internet Research |
title | Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study |
title_full | Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study |
title_fullStr | Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study |
title_full_unstemmed | Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study |
title_short | Subtyping Social Determinants of Health in the "All of Us" Program: Network Analysis and Visualization Study |
title_sort | subtyping social determinants of health in the all of us program network analysis and visualization study |
url | https://www.jmir.org/2025/1/e48775 |
work_keys_str_mv | AT sureshkbhavnani subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT weibinzhang subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT danielbao subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT mukailaraji subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT veronicaajewole subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT rodneyhunter subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT yongfangkuo subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT susanneschmidt subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT moniquerpappadis subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT elisesmith subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT alexbokov subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT timothyreistetter subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT shyamvisweswaran subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy AT briandowner subtypingsocialdeterminantsofhealthintheallofusprogramnetworkanalysisandvisualizationstudy |