Machine learning for prediction of childhood mental health problems in social care

Background Rates of childhood mental health problems are increasing in the UK. Early identification of childhood mental health problems is challenging but critical to children’s future psychosocial development. This is particularly important for children with social care contact because earlier id...

Full description

Saved in:
Bibliographic Details
Main Authors: Ryan Crowley, Katherine Parkin, Emma Rocheteau, Efthalia Massou, Yasmin Friedmann, Ann John, Rachel Sippy, Pietro Liò, Anna Moore
Format: Article
Language:English
Published: Cambridge University Press 2025-05-01
Series:BJPsych Open
Subjects:
Online Access:https://www.cambridge.org/core/product/identifier/S2056472425000328/type/journal_article
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Background Rates of childhood mental health problems are increasing in the UK. Early identification of childhood mental health problems is challenging but critical to children’s future psychosocial development. This is particularly important for children with social care contact because earlier identification can facilitate earlier intervention. Clinical prediction tools could improve these early intervention efforts. Aims Characterise a novel cohort consisting of children in social care and develop effective machine learning models for prediction of childhood mental health problems. Method We used linked, de-identified data from the Secure Anonymised Information Linkage Databank to create a cohort of 26 820 children in Wales, UK, receiving social care services. Integrating health, social care and education data, we developed several machine learning models aimed at predicting childhood mental health problems. We assessed the performance, interpretability and fairness of these models. Results Risk factors strongly associated with childhood mental health problems included age, substance misuse and being a looked after child. The best-performing model, a gradient boosting classifier, achieved an area under the receiver operating characteristic curve of 0.75 (95% CI 0.73–0.78). Assessments of algorithmic fairness showed potential biases within these models. Conclusions Machine learning performance on this prediction task was promising. Predictive performance in social care settings can be bolstered by linking diverse routinely collected data-sets, making available a range of heterogenous risk factors relating to clinical, social and environmental exposures.
ISSN:2056-4724