Comparative analysis of boosting algorithms for predicting personal default

Accurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset...

Full description

Saved in:
Bibliographic Details
Main Authors: Nhat Nguyen, Duy Ngo
Format: Article
Language:English
Published: Taylor & Francis Group 2025-12-01
Series:Cogent Economics & Finance
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850109066479665152
author Nhat Nguyen
Duy Ngo
author_facet Nhat Nguyen
Duy Ngo
author_sort Nhat Nguyen
collection DOAJ
description Accurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset used in the study comprises 7,542 individual customers collected from Vietnamese commercial banks and financial institutions between 2014 and 2022, with 12 features related to the financial and demographic characteristics of the borrowers. All customer-related information is fully anonymized and encrypted during the data collection process to ensure compliance with research ethics. The predictive models are evaluated based on six criteria: Accuracy, Precision, Sensitivity, Specificity, F1 score, and AUC. The results indicate that the LightGBM model has the best performance, demonstrating the ability to efficiently handle large and complex datasets. Additionally, the study identifies the five most significant factors influencing personal default risk: Monthly Liability, Credit Balance, Credit History Length, Max Credit Limit, and Yearly Income. However, the study’s limitations in the size and scope of the dataset may reduce the generalizability of the results when applied to other regions. These findings provide valuable insights that help financial institutions enhance their strategies for managing credit risk effectively.
format Article
id doaj-art-4af44f1abdb640b2b4e5964cc3440930
institution OA Journals
issn 2332-2039
language English
publishDate 2025-12-01
publisher Taylor & Francis Group
record_format Article
series Cogent Economics & Finance
spelling doaj-art-4af44f1abdb640b2b4e5964cc34409302025-08-20T02:38:11ZengTaylor & Francis GroupCogent Economics & Finance2332-20392025-12-0113110.1080/23322039.2025.2465971Comparative analysis of boosting algorithms for predicting personal defaultNhat Nguyen0Duy Ngo1Ho Chi Minh University of Banking, Ho Chi Minh, VietnamHo Chi Minh University of Banking, Ho Chi Minh, VietnamAccurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset used in the study comprises 7,542 individual customers collected from Vietnamese commercial banks and financial institutions between 2014 and 2022, with 12 features related to the financial and demographic characteristics of the borrowers. All customer-related information is fully anonymized and encrypted during the data collection process to ensure compliance with research ethics. The predictive models are evaluated based on six criteria: Accuracy, Precision, Sensitivity, Specificity, F1 score, and AUC. The results indicate that the LightGBM model has the best performance, demonstrating the ability to efficiently handle large and complex datasets. Additionally, the study identifies the five most significant factors influencing personal default risk: Monthly Liability, Credit Balance, Credit History Length, Max Credit Limit, and Yearly Income. However, the study’s limitations in the size and scope of the dataset may reduce the generalizability of the results when applied to other regions. These findings provide valuable insights that help financial institutions enhance their strategies for managing credit risk effectively.https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971Personal default predictionboosting algorithmsfeature importance analysismachine learningfinancial institutionsCredit & Credit Institutions
spellingShingle Nhat Nguyen
Duy Ngo
Comparative analysis of boosting algorithms for predicting personal default
Cogent Economics & Finance
Personal default prediction
boosting algorithms
feature importance analysis
machine learning
financial institutions
Credit & Credit Institutions
title Comparative analysis of boosting algorithms for predicting personal default
title_full Comparative analysis of boosting algorithms for predicting personal default
title_fullStr Comparative analysis of boosting algorithms for predicting personal default
title_full_unstemmed Comparative analysis of boosting algorithms for predicting personal default
title_short Comparative analysis of boosting algorithms for predicting personal default
title_sort comparative analysis of boosting algorithms for predicting personal default
topic Personal default prediction
boosting algorithms
feature importance analysis
machine learning
financial institutions
Credit & Credit Institutions
url https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971
work_keys_str_mv AT nhatnguyen comparativeanalysisofboostingalgorithmsforpredictingpersonaldefault
AT duyngo comparativeanalysisofboostingalgorithmsforpredictingpersonaldefault