The Risk of Linked Census Data to Transgender Youth

Every ten years, the United States Census Bureau collects data on all people living in the US, including information on age, sex, race, ethnicity, and household relationship. We conducted a simulation study to investigate the risk of disclosing a change in how an individual's sex was recorded...

Full description

Saved in:
Bibliographic Details
Main Authors: Abraham Flaxman, Os Keyes
Format: Article
Language:English
Published: Labor Dynamics Institute 2025-03-01
Series:The Journal of Privacy and Confidentiality
Subjects:
Online Access:https://journalprivacyconfidentiality.org/index.php/jpc/article/view/891
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849701345507934208
author Abraham Flaxman
Os Keyes
author_facet Abraham Flaxman
Os Keyes
author_sort Abraham Flaxman
collection DOAJ
description Every ten years, the United States Census Bureau collects data on all people living in the US, including information on age, sex, race, ethnicity, and household relationship. We conducted a simulation study to investigate the risk of disclosing a change in how an individual's sex was recorded in successive censuses. In a simulated population based on a reconstruction of the 2010 decennial census of Texas, we compared the number of transgender individuals under 18 identified by linking simulated census data from 2010 and 2020 under alternative approaches to disclosure avoidance, including swapping in 2020 (as used in 2010) and TopDown in 2020 (as used for the actual data released from the 2020 enumeration). Our simulation assumed that in Texas, 0.2\% of the 3,095,857 residents who were under the age of 8 in the 2010 census were transgender and would have a different sex reported in the 2020 census, and 23\% would reside at the same address, which implied that 1,424 trans youth were at risk of having their transgender identity status disclosed by a reconstruction-abetted linkage attack. We found that without any disclosure avoidance in 2010 or 2020, a reconstruction-abetted linkage attack identified 657 transgender youth. With 5\% swapping in 2010 and 2020, it identified 605 individuals, an 8\% decrease. With swapping in 2010 and TopDown in 2020 as configured for the actual data release, it identified 194 individuals, a 68\% decrease from swapping. Our simulation found that the TopDown configuration attains the maximum achievable level of privacy protection against such an attack. Our results demonstrate the importance of disclosure avoidance for census data and suggest that the TopDown approach used by the Census Bureau is a substantial improvement compared to the previous approach, achieving the maximum level of privacy protection possible against such a linkage attack.
format Article
id doaj-art-c16a0d89ac1a4f27a2fd57cc4c5646d3
institution DOAJ
issn 2575-8527
language English
publishDate 2025-03-01
publisher Labor Dynamics Institute
record_format Article
series The Journal of Privacy and Confidentiality
spelling doaj-art-c16a0d89ac1a4f27a2fd57cc4c5646d32025-08-20T03:17:58ZengLabor Dynamics InstituteThe Journal of Privacy and Confidentiality2575-85272025-03-0115110.29012/jpc.891The Risk of Linked Census Data to Transgender YouthAbraham Flaxman0Os Keyes1University of WashingtonUniversity of Washington Every ten years, the United States Census Bureau collects data on all people living in the US, including information on age, sex, race, ethnicity, and household relationship. We conducted a simulation study to investigate the risk of disclosing a change in how an individual's sex was recorded in successive censuses. In a simulated population based on a reconstruction of the 2010 decennial census of Texas, we compared the number of transgender individuals under 18 identified by linking simulated census data from 2010 and 2020 under alternative approaches to disclosure avoidance, including swapping in 2020 (as used in 2010) and TopDown in 2020 (as used for the actual data released from the 2020 enumeration). Our simulation assumed that in Texas, 0.2\% of the 3,095,857 residents who were under the age of 8 in the 2010 census were transgender and would have a different sex reported in the 2020 census, and 23\% would reside at the same address, which implied that 1,424 trans youth were at risk of having their transgender identity status disclosed by a reconstruction-abetted linkage attack. We found that without any disclosure avoidance in 2010 or 2020, a reconstruction-abetted linkage attack identified 657 transgender youth. With 5\% swapping in 2010 and 2020, it identified 605 individuals, an 8\% decrease. With swapping in 2010 and TopDown in 2020 as configured for the actual data release, it identified 194 individuals, a 68\% decrease from swapping. Our simulation found that the TopDown configuration attains the maximum achievable level of privacy protection against such an attack. Our results demonstrate the importance of disclosure avoidance for census data and suggest that the TopDown approach used by the Census Bureau is a substantial improvement compared to the previous approach, achieving the maximum level of privacy protection possible against such a linkage attack. https://journalprivacyconfidentiality.org/index.php/jpc/article/view/891censusdisclosure avoidancelinkage attack
spellingShingle Abraham Flaxman
Os Keyes
The Risk of Linked Census Data to Transgender Youth
The Journal of Privacy and Confidentiality
census
disclosure avoidance
linkage attack
title The Risk of Linked Census Data to Transgender Youth
title_full The Risk of Linked Census Data to Transgender Youth
title_fullStr The Risk of Linked Census Data to Transgender Youth
title_full_unstemmed The Risk of Linked Census Data to Transgender Youth
title_short The Risk of Linked Census Data to Transgender Youth
title_sort risk of linked census data to transgender youth
topic census
disclosure avoidance
linkage attack
url https://journalprivacyconfidentiality.org/index.php/jpc/article/view/891
work_keys_str_mv AT abrahamflaxman theriskoflinkedcensusdatatotransgenderyouth
AT oskeyes theriskoflinkedcensusdatatotransgenderyouth
AT abrahamflaxman riskoflinkedcensusdatatotransgenderyouth
AT oskeyes riskoflinkedcensusdatatotransgenderyouth