A data pipeline for secure extraction and sharing of social determinants of health.

<h4>Objectives</h4>Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deide...

Full description

Saved in:
Bibliographic Details
Main Authors: Tyler Schappe, Lisa M McElroy, Moronke Ogundolie, Roland Matsouaka, Ursula Rogers, Nrupen A Bhavsar
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0317215
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:<h4>Objectives</h4>Linking neighborhood- and patient-level data provides valuable information about the influence of upstream social determinants of health (SDOH). However, sharing of these data across health systems presents challenges. We set out to develop a pipeline to acquire, deidentify, and share neighborhood-level SDOH data across multiple health systems.<h4>Methods</h4>We created a pipeline centered around Decentralized Geomarker Assessment for Multi-Site Studies (DeGAUSS) that utilizes containerization to geocode patient addresses and obtain neighborhood-level SDOH variables. We compared DeGAUSS to a third-party vendor geocoding tool available at Duke Health using a cohort of adult patients referred for abdominal transplant from January 1, 2016, to December 31, 2022. We calculated Cohen's Kappa and percent disagreement at census block group and tract levels, and by Area Deprivation Index, urbanicity, and year.<h4>Results</h4>The pipeline successfully generated SDOH data for 97.8% of addresses. There was high concordance between DeGAUSS and the vendor tool at the census block group (0.93) and tract levels (0.95). At the block group level, disagreement proportion differed by year and urbanicity, with larger disagreement in the rural category than in micropolitan and metropolitan categories (13%, 7%, 6.2%, respectively).<h4>Discussion and conclusion</h4>We describe a novel pipeline that can facilitate the secure acquisition and sharing of neighborhood-level SDOH without sharing PHI. The pipeline can be scaled to include additional social, climate, and environmental variables, and can be extended to an unlimited number of health systems.
ISSN:1932-6203