Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)

BackgroundInternet Addiction (IA) has emerged as a critical concern, especially among school age children and adolescents, potentially stalling their physical and mental development. Our study aimed to examine the risk factors associated with IA among Chinese children and adolescents and leverage ex...

Full description

Saved in:
Bibliographic Details
Main Authors: Jiahe Liu, Lang Chen, Yuxin Chen, Jingsong Luo, Kexin Yu, Linlin Fan, Chan Yong, Huiyu He, Simei Liao, Zongyuan Ge, Lihua Jiang
Format: Article
Language:English
Published: Frontiers Media S.A. 2025-07-01
Series:Frontiers in Public Health
Subjects:
Online Access:https://www.frontiersin.org/articles/10.3389/fpubh.2025.1590689/full
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849318460998287360
author Jiahe Liu
Lang Chen
Yuxin Chen
Jingsong Luo
Kexin Yu
Linlin Fan
Chan Yong
Huiyu He
Simei Liao
Zongyuan Ge
Lihua Jiang
Lihua Jiang
author_facet Jiahe Liu
Lang Chen
Yuxin Chen
Jingsong Luo
Kexin Yu
Linlin Fan
Chan Yong
Huiyu He
Simei Liao
Zongyuan Ge
Lihua Jiang
Lihua Jiang
author_sort Jiahe Liu
collection DOAJ
description BackgroundInternet Addiction (IA) has emerged as a critical concern, especially among school age children and adolescents, potentially stalling their physical and mental development. Our study aimed to examine the risk factors associated with IA among Chinese children and adolescents and leverage explainable machine learning (ML) algorithms to predict IA status at the time of assessment, based on Young’s Internet Addiction Test.MethodsThe longitudinal data consisting of 8,824 schoolchildren from the Chengdu Positive Child Development (CPCD) survey were analyzed, where 33.3% of participants were identified with IA (Age: 10.97 ± 2.31, Male: 51.73%). IA was defined using Young’s Internet Addiction Test (IAT ≥ 40). Demographic variables such as age, gender, and grade level, along with key variables including scores of Cognitive Behavioral Competencies (CBC), Prosocial Attributes (PA), Positive Identity (PI), General Positive Youth Development Qualities (GPYDQ), Life Satisfaction (LS), Delinquent Behavior (DB), Non-Suicidal Self-Injury (NSSI), Depression (DP), Anxiety (AX), Family Function Disorders (FF), Egocentrism (EG), Empathy (EP), Academic Intrinsic Value (IV), and Academic Utility Value (UV) were examined. Chi-square and Mann–Whitney U tests were employed to validate the significance of the mentioned predictors of IA. We applied six ML models: Extra Random Forest, XGBoost, Logistic Regression, Bernoulli Naïve Bayes, Multi-Layer Perceptron (MLP), and Transformer Encoder. Performance was evaluated via 10-fold cross-validation and held-out test sets across survey waves. Feature selection and SHapley Additive exPlanations (SHAP) analysis were utilised for model improvement and interpretability, respectively.ResultsExtraRFC achieved the best performance (Test AUC = 0.854, Accuracy = 0.798, F1 = 0.659), outperforming all other models across most metrics and external validations. Key predictors included grade level, delinquent behavior, anxiety, family function, and depression scores. SHAP analysis revealed consistent and interpretable feature contributions across individuals.ConclusionDepression, anxiety, and family dynamics are significant factors influencing IA in children. The Extra Random Forest model proves most effective in predicting IA, emphasising the importance of addressing these factors to promote healthy digital habits in children. This study presents an effective SHAP-based explainable ML framework for IA prediction in children and adolescents.
format Article
id doaj-art-15833cd17135471394d5ddf6047c8cbb
institution Kabale University
issn 2296-2565
language English
publishDate 2025-07-01
publisher Frontiers Media S.A.
record_format Article
series Frontiers in Public Health
spelling doaj-art-15833cd17135471394d5ddf6047c8cbb2025-08-20T03:50:49ZengFrontiers Media S.A.Frontiers in Public Health2296-25652025-07-011310.3389/fpubh.2025.15906891590689Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)Jiahe Liu0Lang Chen1Yuxin Chen2Jingsong Luo3Kexin Yu4Linlin Fan5Chan Yong6Huiyu He7Simei Liao8Zongyuan Ge9Lihua Jiang10Lihua Jiang11AIM for Health Lab, Monash University, Melbourne, VIC, AustraliaSchool of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, AustraliaJockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, Hong Kong SAR, ChinaJockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, Hong Kong SAR, ChinaJockey Club School of Public Health and Primary Care, The Chinese University of Hong Kong, Hong Kong, Hong Kong SAR, ChinaWest China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, ChinaWest China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, ChinaWest China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, ChinaSchool of Nursing, Capital Medical University, Peking, ChinaAIM for Health Lab, Monash University, Melbourne, VIC, AustraliaWest China School of Public Health and West China Fourth Hospital, Sichuan University, Chengdu, ChinaGeneral Practice Ward, International Medical Center Ward, General Practice Medical Center, West China Hospital, Sichuan University, Chengdu, ChinaBackgroundInternet Addiction (IA) has emerged as a critical concern, especially among school age children and adolescents, potentially stalling their physical and mental development. Our study aimed to examine the risk factors associated with IA among Chinese children and adolescents and leverage explainable machine learning (ML) algorithms to predict IA status at the time of assessment, based on Young’s Internet Addiction Test.MethodsThe longitudinal data consisting of 8,824 schoolchildren from the Chengdu Positive Child Development (CPCD) survey were analyzed, where 33.3% of participants were identified with IA (Age: 10.97 ± 2.31, Male: 51.73%). IA was defined using Young’s Internet Addiction Test (IAT ≥ 40). Demographic variables such as age, gender, and grade level, along with key variables including scores of Cognitive Behavioral Competencies (CBC), Prosocial Attributes (PA), Positive Identity (PI), General Positive Youth Development Qualities (GPYDQ), Life Satisfaction (LS), Delinquent Behavior (DB), Non-Suicidal Self-Injury (NSSI), Depression (DP), Anxiety (AX), Family Function Disorders (FF), Egocentrism (EG), Empathy (EP), Academic Intrinsic Value (IV), and Academic Utility Value (UV) were examined. Chi-square and Mann–Whitney U tests were employed to validate the significance of the mentioned predictors of IA. We applied six ML models: Extra Random Forest, XGBoost, Logistic Regression, Bernoulli Naïve Bayes, Multi-Layer Perceptron (MLP), and Transformer Encoder. Performance was evaluated via 10-fold cross-validation and held-out test sets across survey waves. Feature selection and SHapley Additive exPlanations (SHAP) analysis were utilised for model improvement and interpretability, respectively.ResultsExtraRFC achieved the best performance (Test AUC = 0.854, Accuracy = 0.798, F1 = 0.659), outperforming all other models across most metrics and external validations. Key predictors included grade level, delinquent behavior, anxiety, family function, and depression scores. SHAP analysis revealed consistent and interpretable feature contributions across individuals.ConclusionDepression, anxiety, and family dynamics are significant factors influencing IA in children. The Extra Random Forest model proves most effective in predicting IA, emphasising the importance of addressing these factors to promote healthy digital habits in children. This study presents an effective SHAP-based explainable ML framework for IA prediction in children and adolescents.https://www.frontiersin.org/articles/10.3389/fpubh.2025.1590689/fullinternet addictionadolescent and childrenmachine learningextra random forestlongitudinal study
spellingShingle Jiahe Liu
Lang Chen
Yuxin Chen
Jingsong Luo
Kexin Yu
Linlin Fan
Chan Yong
Huiyu He
Simei Liao
Zongyuan Ge
Lihua Jiang
Lihua Jiang
Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
Frontiers in Public Health
internet addiction
adolescent and children
machine learning
extra random forest
longitudinal study
title Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
title_full Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
title_fullStr Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
title_full_unstemmed Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
title_short Explainable machine learning prediction of internet addiction among Chinese primary and middle school children and adolescents: a longitudinal study based on positive youth development data (2019–2022)
title_sort explainable machine learning prediction of internet addiction among chinese primary and middle school children and adolescents a longitudinal study based on positive youth development data 2019 2022
topic internet addiction
adolescent and children
machine learning
extra random forest
longitudinal study
url https://www.frontiersin.org/articles/10.3389/fpubh.2025.1590689/full
work_keys_str_mv AT jiaheliu explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT langchen explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT yuxinchen explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT jingsongluo explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT kexinyu explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT linlinfan explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT chanyong explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT huiyuhe explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT simeiliao explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT zongyuange explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT lihuajiang explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022
AT lihuajiang explainablemachinelearningpredictionofinternetaddictionamongchineseprimaryandmiddleschoolchildrenandadolescentsalongitudinalstudybasedonpositiveyouthdevelopmentdata20192022