Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM

Abstract This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is em...

Full description

Saved in:
Bibliographic Details
Main Authors: Cheng Liu, Weiming Yang
Format: Article
Language:English
Published: SpringerOpen 2025-04-01
Series:Energy Informatics
Subjects:
Online Access:https://doi.org/10.1186/s42162-025-00519-3
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850172387920707584
author Cheng Liu
Weiming Yang
author_facet Cheng Liu
Weiming Yang
author_sort Cheng Liu
collection DOAJ
description Abstract This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is employed to evaluate feature importance and select a subset that significantly influences model predictions, thereby simplifying the model and enhancing its interpretability. Following this, the bald eagle search (BES) intelligent optimization algorithm is utilized to optimize the hyperparameters of the light gradient boosting machine (LGBM) model, further improving its predictive capability. Comparative experiments with various traditional machine learning models validate the effectiveness of the proposed method. The SHAP-BES-LGBM model achieves the highest accuracy of 0.9509 and an f1 score of 0.9606 on the test set, with only 11 samples misclassified, demonstrating superior classification performance and underscoring the advantages of this integrated approach in transformer fault diagnosis.
format Article
id doaj-art-35e4561ca1f04d2f91e9f94343f1bcc7
institution OA Journals
issn 2520-8942
language English
publishDate 2025-04-01
publisher SpringerOpen
record_format Article
series Energy Informatics
spelling doaj-art-35e4561ca1f04d2f91e9f94343f1bcc72025-08-20T02:20:06ZengSpringerOpenEnergy Informatics2520-89422025-04-018111910.1186/s42162-025-00519-3Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBMCheng Liu0Weiming Yang1Department of Digital Business, Jiangsu Vocational Institute of CommerceDepartment of Digital Business, Jiangsu Vocational Institute of CommerceAbstract This paper proposes a novel approach for transformer fault diagnosis. Initially, a high-dimensional feature set comprising 19 features related to five gas concentrations is constructed to reflect the gas-fault relationship. Subsequently, the Shapley Additive Explanations (SHAP) method is employed to evaluate feature importance and select a subset that significantly influences model predictions, thereby simplifying the model and enhancing its interpretability. Following this, the bald eagle search (BES) intelligent optimization algorithm is utilized to optimize the hyperparameters of the light gradient boosting machine (LGBM) model, further improving its predictive capability. Comparative experiments with various traditional machine learning models validate the effectiveness of the proposed method. The SHAP-BES-LGBM model achieves the highest accuracy of 0.9509 and an f1 score of 0.9606 on the test set, with only 11 samples misclassified, demonstrating superior classification performance and underscoring the advantages of this integrated approach in transformer fault diagnosis.https://doi.org/10.1186/s42162-025-00519-3Transformer fault diagnosisLight gradient boosting machineIntelligent optimization algorithmsShapley additive explanations
spellingShingle Cheng Liu
Weiming Yang
Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
Energy Informatics
Transformer fault diagnosis
Light gradient boosting machine
Intelligent optimization algorithms
Shapley additive explanations
title Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
title_full Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
title_fullStr Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
title_full_unstemmed Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
title_short Transformer fault diagnosis using machine learning: a method combining SHAP feature selection and intelligent optimization of LGBM
title_sort transformer fault diagnosis using machine learning a method combining shap feature selection and intelligent optimization of lgbm
topic Transformer fault diagnosis
Light gradient boosting machine
Intelligent optimization algorithms
Shapley additive explanations
url https://doi.org/10.1186/s42162-025-00519-3
work_keys_str_mv AT chengliu transformerfaultdiagnosisusingmachinelearningamethodcombiningshapfeatureselectionandintelligentoptimizationoflgbm
AT weimingyang transformerfaultdiagnosisusingmachinelearningamethodcombiningshapfeatureselectionandintelligentoptimizationoflgbm