Comparative Analysis of Several Models for Churning Customer Prediction

Customer churn prediction is critical for financial institutions to retain clients and optimize resource allocation. It is less expensive to keep current clients than to find new ones. There lots of research in this field, but their performance is often limited by data imbalance issues. This study c...

Full description

Saved in:
Bibliographic Details
Main Author: Tan Zhaoyuan
Format: Article
Language:English
Published: EDP Sciences 2025-01-01
Series:SHS Web of Conferences
Online Access:https://www.shs-conferences.org/articles/shsconf/pdf/2025/09/shsconf_icdde2025_02013.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849425494623125504
author Tan Zhaoyuan
author_facet Tan Zhaoyuan
author_sort Tan Zhaoyuan
collection DOAJ
description Customer churn prediction is critical for financial institutions to retain clients and optimize resource allocation. It is less expensive to keep current clients than to find new ones. There lots of research in this field, but their performance is often limited by data imbalance issues. This study compares three machine learning models: Random Forest, XGBoost Classifier, and Light Gradient Boosting Machine Classifier for predicting credit card customer churn using a dataset from Kaggle. The research addresses data imbalance issues through oversampling techniques (SMOTE, SMOTEENN, Borderline SMOTE) and evaluates model performance using accuracy and F1 score. Results show that the LGBM Classifier with Borderline SMOTE achieves the highest accuracy (97.43%) and F1 score (0.9259), outperforming other methods. This approach effectively balances precision and recall, improving minority class prediction. These findings provide actionable insights for financial institutions to implement proactive retention strategies. There are still limitations and future work to do. More different datasets, updated models for small datasets, and more feature engineering methods should be taken into consideration.
format Article
id doaj-art-e6ee57be223f4e44a6826c7124b84d3a
institution Kabale University
issn 2261-2424
language English
publishDate 2025-01-01
publisher EDP Sciences
record_format Article
series SHS Web of Conferences
spelling doaj-art-e6ee57be223f4e44a6826c7124b84d3a2025-08-20T03:29:44ZengEDP SciencesSHS Web of Conferences2261-24242025-01-012180201310.1051/shsconf/202521802013shsconf_icdde2025_02013Comparative Analysis of Several Models for Churning Customer PredictionTan Zhaoyuan0School of Finance, Tianjin University of Finance and EconomicsCustomer churn prediction is critical for financial institutions to retain clients and optimize resource allocation. It is less expensive to keep current clients than to find new ones. There lots of research in this field, but their performance is often limited by data imbalance issues. This study compares three machine learning models: Random Forest, XGBoost Classifier, and Light Gradient Boosting Machine Classifier for predicting credit card customer churn using a dataset from Kaggle. The research addresses data imbalance issues through oversampling techniques (SMOTE, SMOTEENN, Borderline SMOTE) and evaluates model performance using accuracy and F1 score. Results show that the LGBM Classifier with Borderline SMOTE achieves the highest accuracy (97.43%) and F1 score (0.9259), outperforming other methods. This approach effectively balances precision and recall, improving minority class prediction. These findings provide actionable insights for financial institutions to implement proactive retention strategies. There are still limitations and future work to do. More different datasets, updated models for small datasets, and more feature engineering methods should be taken into consideration.https://www.shs-conferences.org/articles/shsconf/pdf/2025/09/shsconf_icdde2025_02013.pdf
spellingShingle Tan Zhaoyuan
Comparative Analysis of Several Models for Churning Customer Prediction
SHS Web of Conferences
title Comparative Analysis of Several Models for Churning Customer Prediction
title_full Comparative Analysis of Several Models for Churning Customer Prediction
title_fullStr Comparative Analysis of Several Models for Churning Customer Prediction
title_full_unstemmed Comparative Analysis of Several Models for Churning Customer Prediction
title_short Comparative Analysis of Several Models for Churning Customer Prediction
title_sort comparative analysis of several models for churning customer prediction
url https://www.shs-conferences.org/articles/shsconf/pdf/2025/09/shsconf_icdde2025_02013.pdf
work_keys_str_mv AT tanzhaoyuan comparativeanalysisofseveralmodelsforchurningcustomerprediction