Model interpretability on private-safe oriented student dropout prediction.

Student dropout is a significant social issue with extensive implications for individuals and society, including reduced employability and economic downturns, which, in turn, drastically influence social sustainable development. Identifying students at high risk of dropping out is a major challenge...

Full description

Saved in:
Bibliographic Details
Main Authors: Helai Liu, Mao Mao, Xia Li, Jia Gao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0317726
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850216176340172800
author Helai Liu
Mao Mao
Xia Li
Jia Gao
author_facet Helai Liu
Mao Mao
Xia Li
Jia Gao
author_sort Helai Liu
collection DOAJ
description Student dropout is a significant social issue with extensive implications for individuals and society, including reduced employability and economic downturns, which, in turn, drastically influence social sustainable development. Identifying students at high risk of dropping out is a major challenge for sustainable education. While existing machine learning and deep learning models can effectively predict dropout risks, they often rely on real student data, raising ethical concerns and the risk of information leakage. Additionally, the poor interpretability of these models complicates their use in educational management, as it is difficult to justify identifying a student as high-risk based on an opaque model. To address these two issues, we introduced for the first time a modified Preprocessed Kernel Inducing Points data distillation technique (PP-KIPDD), specializing in distilling tabular structured dataset, and innovatively employed the PP-KIPDD to reconstruct new samples that serve as qualified training sets simulating student information distributions, thereby preventing student privacy information leakage, which showed better performance and efficiency compared to traditional data synthesis techniques such as the Conditional Generative Adversarial Networks. Furthermore, we empower the classifiers credibility by enhancing model interpretability utilized SHAP (SHapley Additive exPlanations) values and elucidated the significance of selected features from an educational management perspective. With well-explained features from both quantitative and qualitative aspects, our approach enhances the feasibility and reasonableness of dropout predictions using machine learning techniques. We believe our approach represents a novel end-to-end framework of artificial intelligence application in the field of sustainable education management from the view of decision-makers, as it addresses privacy leakage protection and enhances model credibility for practical management implementations.
format Article
id doaj-art-ffaeb3f4a6ca4246b83a8d807b30ec8f
institution OA Journals
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-ffaeb3f4a6ca4246b83a8d807b30ec8f2025-08-20T02:08:23ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01203e031772610.1371/journal.pone.0317726Model interpretability on private-safe oriented student dropout prediction.Helai LiuMao MaoXia LiJia GaoStudent dropout is a significant social issue with extensive implications for individuals and society, including reduced employability and economic downturns, which, in turn, drastically influence social sustainable development. Identifying students at high risk of dropping out is a major challenge for sustainable education. While existing machine learning and deep learning models can effectively predict dropout risks, they often rely on real student data, raising ethical concerns and the risk of information leakage. Additionally, the poor interpretability of these models complicates their use in educational management, as it is difficult to justify identifying a student as high-risk based on an opaque model. To address these two issues, we introduced for the first time a modified Preprocessed Kernel Inducing Points data distillation technique (PP-KIPDD), specializing in distilling tabular structured dataset, and innovatively employed the PP-KIPDD to reconstruct new samples that serve as qualified training sets simulating student information distributions, thereby preventing student privacy information leakage, which showed better performance and efficiency compared to traditional data synthesis techniques such as the Conditional Generative Adversarial Networks. Furthermore, we empower the classifiers credibility by enhancing model interpretability utilized SHAP (SHapley Additive exPlanations) values and elucidated the significance of selected features from an educational management perspective. With well-explained features from both quantitative and qualitative aspects, our approach enhances the feasibility and reasonableness of dropout predictions using machine learning techniques. We believe our approach represents a novel end-to-end framework of artificial intelligence application in the field of sustainable education management from the view of decision-makers, as it addresses privacy leakage protection and enhances model credibility for practical management implementations.https://doi.org/10.1371/journal.pone.0317726
spellingShingle Helai Liu
Mao Mao
Xia Li
Jia Gao
Model interpretability on private-safe oriented student dropout prediction.
PLoS ONE
title Model interpretability on private-safe oriented student dropout prediction.
title_full Model interpretability on private-safe oriented student dropout prediction.
title_fullStr Model interpretability on private-safe oriented student dropout prediction.
title_full_unstemmed Model interpretability on private-safe oriented student dropout prediction.
title_short Model interpretability on private-safe oriented student dropout prediction.
title_sort model interpretability on private safe oriented student dropout prediction
url https://doi.org/10.1371/journal.pone.0317726
work_keys_str_mv AT helailiu modelinterpretabilityonprivatesafeorientedstudentdropoutprediction
AT maomao modelinterpretabilityonprivatesafeorientedstudentdropoutprediction
AT xiali modelinterpretabilityonprivatesafeorientedstudentdropoutprediction
AT jiagao modelinterpretabilityonprivatesafeorientedstudentdropoutprediction