Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London
Objectives Ethnicity data are critical for identifying inequalities, but previous studies suggest that ethnicity is not consistently recorded between different administrative datasets. With researchers increasingly leveraging cross-domain data linkages, we investigated the completeness and consisten...
Saved in:
| Main Authors: | , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
BMJ Publishing Group
2024-03-01
|
| Series: | BMJ Open |
| Online Access: | https://bmjopen.bmj.com/content/14/3/e078788.full |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850195046697009152 |
|---|---|
| author | Tamsin Ford Johnny Downs Robert Stewart Amelia Jewell Jayati Das-Munshi Alice Wickersham |
| author_facet | Tamsin Ford Johnny Downs Robert Stewart Amelia Jewell Jayati Das-Munshi Alice Wickersham |
| author_sort | Tamsin Ford |
| collection | DOAJ |
| description | Objectives Ethnicity data are critical for identifying inequalities, but previous studies suggest that ethnicity is not consistently recorded between different administrative datasets. With researchers increasingly leveraging cross-domain data linkages, we investigated the completeness and consistency of ethnicity data in two linked health and education datasets.Design Cohort study.Setting South London and Maudsley NHS Foundation Trust deidentified electronic health records, accessed via Clinical Record Interactive Search (CRIS) and the National Pupil Database (NPD) (2007–2013).Participants N=30 426 children and adolescents referred to local Child and Adolescent Mental Health Services.Primary and secondary outcome measures Ethnicity data were compared between CRIS and the NPD. Associations between ethnicity as recorded from each source and key educational and clinical outcomes were explored with risk ratios.Results Ethnicity data were available for 79.3% from the NPD, 87.0% from CRIS, 97.3% from either source and 69.0% from both sources. Among those who had ethnicity data from both, the two data sources agreed on 87.0% of aggregate ethnicity categorisations overall, but with high levels of disagreement in Mixed and Other ethnic groups. Strengths of associations between ethnicity, educational attainment and neurodevelopmental disorder varied according to which data source was used to code ethnicity. For example, as compared with White pupils, a significantly higher proportion of Asian pupils achieved expected educational attainment thresholds only if ethnicity was coded from the NPD (RR=1.46, 95% CI 1.29 to 1.64), not if ethnicity was coded from CRIS (RR=1.11, 0.98 to 1.26).Conclusions Data linkage has the potential to minimise missing ethnicity data, and overlap in ethnicity categorisations between CRIS and the NPD was generally high. However, choosing which data source to primarily code ethnicity from can have implications for analyses of ethnicity, mental health and educational outcomes. Users of linked data should exercise caution in combining and comparing ethnicity between different data sources. |
| format | Article |
| id | doaj-art-959ca03a77e04ca09f1c7b42b1ee1397 |
| institution | OA Journals |
| issn | 2044-6055 |
| language | English |
| publishDate | 2024-03-01 |
| publisher | BMJ Publishing Group |
| record_format | Article |
| series | BMJ Open |
| spelling | doaj-art-959ca03a77e04ca09f1c7b42b1ee13972025-08-20T02:13:52ZengBMJ Publishing GroupBMJ Open2044-60552024-03-0114310.1136/bmjopen-2023-078788Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South LondonTamsin Ford0Johnny Downs1Robert Stewart2Amelia Jewell3Jayati Das-Munshi4Alice Wickersham5Department of Psychiatry, University of Cambridge, Cambridge, UKCAMHS Digital Lab, Department of Child and Adolescent Psychiatry, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UKSouth London and Maudsley NHS Foundation Trust, London, UKSouth London & Maudsley NHS Foundation Trust, London, UKSouth London and Maudsley NHS Foundation Trust, London, UKCAMHS Digital Lab, Department of Child and Adolescent Psychiatry, Institute of Psychiatry, Psychology & Neuroscience, King’s College London, London, UKObjectives Ethnicity data are critical for identifying inequalities, but previous studies suggest that ethnicity is not consistently recorded between different administrative datasets. With researchers increasingly leveraging cross-domain data linkages, we investigated the completeness and consistency of ethnicity data in two linked health and education datasets.Design Cohort study.Setting South London and Maudsley NHS Foundation Trust deidentified electronic health records, accessed via Clinical Record Interactive Search (CRIS) and the National Pupil Database (NPD) (2007–2013).Participants N=30 426 children and adolescents referred to local Child and Adolescent Mental Health Services.Primary and secondary outcome measures Ethnicity data were compared between CRIS and the NPD. Associations between ethnicity as recorded from each source and key educational and clinical outcomes were explored with risk ratios.Results Ethnicity data were available for 79.3% from the NPD, 87.0% from CRIS, 97.3% from either source and 69.0% from both sources. Among those who had ethnicity data from both, the two data sources agreed on 87.0% of aggregate ethnicity categorisations overall, but with high levels of disagreement in Mixed and Other ethnic groups. Strengths of associations between ethnicity, educational attainment and neurodevelopmental disorder varied according to which data source was used to code ethnicity. For example, as compared with White pupils, a significantly higher proportion of Asian pupils achieved expected educational attainment thresholds only if ethnicity was coded from the NPD (RR=1.46, 95% CI 1.29 to 1.64), not if ethnicity was coded from CRIS (RR=1.11, 0.98 to 1.26).Conclusions Data linkage has the potential to minimise missing ethnicity data, and overlap in ethnicity categorisations between CRIS and the NPD was generally high. However, choosing which data source to primarily code ethnicity from can have implications for analyses of ethnicity, mental health and educational outcomes. Users of linked data should exercise caution in combining and comparing ethnicity between different data sources.https://bmjopen.bmj.com/content/14/3/e078788.full |
| spellingShingle | Tamsin Ford Johnny Downs Robert Stewart Amelia Jewell Jayati Das-Munshi Alice Wickersham Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London BMJ Open |
| title | Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London |
| title_full | Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London |
| title_fullStr | Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London |
| title_full_unstemmed | Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London |
| title_short | Impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data: a data linkage study of Child and Adolescent Mental Health Services in South London |
| title_sort | impact of inconsistent ethnicity recordings on estimates of inequality in child health and education data a data linkage study of child and adolescent mental health services in south london |
| url | https://bmjopen.bmj.com/content/14/3/e078788.full |
| work_keys_str_mv | AT tamsinford impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon AT johnnydowns impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon AT robertstewart impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon AT ameliajewell impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon AT jayatidasmunshi impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon AT alicewickersham impactofinconsistentethnicityrecordingsonestimatesofinequalityinchildhealthandeducationdataadatalinkagestudyofchildandadolescentmentalhealthservicesinsouthlondon |