Comparative analysis of boosting algorithms for predicting personal default

Accurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset...

Full description

Saved in:

Bibliographic Details
Main Authors:	Nhat Nguyen, Duy Ngo
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2025-12-01
Series:	Cogent Economics & Finance
Subjects:	Personal default prediction boosting algorithms feature importance analysis machine learning financial institutions Credit & Credit Institutions
Online Access:	https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850109066479665152
author	Nhat Nguyen Duy Ngo
author_facet	Nhat Nguyen Duy Ngo
author_sort	Nhat Nguyen
collection	DOAJ
description	Accurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset used in the study comprises 7,542 individual customers collected from Vietnamese commercial banks and financial institutions between 2014 and 2022, with 12 features related to the financial and demographic characteristics of the borrowers. All customer-related information is fully anonymized and encrypted during the data collection process to ensure compliance with research ethics. The predictive models are evaluated based on six criteria: Accuracy, Precision, Sensitivity, Specificity, F1 score, and AUC. The results indicate that the LightGBM model has the best performance, demonstrating the ability to efficiently handle large and complex datasets. Additionally, the study identifies the five most significant factors influencing personal default risk: Monthly Liability, Credit Balance, Credit History Length, Max Credit Limit, and Yearly Income. However, the study’s limitations in the size and scope of the dataset may reduce the generalizability of the results when applied to other regions. These findings provide valuable insights that help financial institutions enhance their strategies for managing credit risk effectively.
format	Article
id	doaj-art-4af44f1abdb640b2b4e5964cc3440930
institution	OA Journals
issn	2332-2039
language	English
publishDate	2025-12-01
publisher	Taylor & Francis Group
record_format	Article
series	Cogent Economics & Finance
spelling	doaj-art-4af44f1abdb640b2b4e5964cc34409302025-08-20T02:38:11ZengTaylor & Francis GroupCogent Economics & Finance2332-20392025-12-0113110.1080/23322039.2025.2465971Comparative analysis of boosting algorithms for predicting personal defaultNhat Nguyen0Duy Ngo1Ho Chi Minh University of Banking, Ho Chi Minh, VietnamHo Chi Minh University of Banking, Ho Chi Minh, VietnamAccurately predicting personal default risk is crucial for financial institutions to manage credit risk effectively. This study conducts a comparative analysis of the performance of boosting algorithms, including AdaBoost, XGBoost, LightGBM, and CatBoost, in predicting personal defaults. The dataset used in the study comprises 7,542 individual customers collected from Vietnamese commercial banks and financial institutions between 2014 and 2022, with 12 features related to the financial and demographic characteristics of the borrowers. All customer-related information is fully anonymized and encrypted during the data collection process to ensure compliance with research ethics. The predictive models are evaluated based on six criteria: Accuracy, Precision, Sensitivity, Specificity, F1 score, and AUC. The results indicate that the LightGBM model has the best performance, demonstrating the ability to efficiently handle large and complex datasets. Additionally, the study identifies the five most significant factors influencing personal default risk: Monthly Liability, Credit Balance, Credit History Length, Max Credit Limit, and Yearly Income. However, the study’s limitations in the size and scope of the dataset may reduce the generalizability of the results when applied to other regions. These findings provide valuable insights that help financial institutions enhance their strategies for managing credit risk effectively.https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971Personal default predictionboosting algorithmsfeature importance analysismachine learningfinancial institutionsCredit & Credit Institutions
spellingShingle	Nhat Nguyen Duy Ngo Comparative analysis of boosting algorithms for predicting personal default Cogent Economics & Finance Personal default prediction boosting algorithms feature importance analysis machine learning financial institutions Credit & Credit Institutions
title	Comparative analysis of boosting algorithms for predicting personal default
title_full	Comparative analysis of boosting algorithms for predicting personal default
title_fullStr	Comparative analysis of boosting algorithms for predicting personal default
title_full_unstemmed	Comparative analysis of boosting algorithms for predicting personal default
title_short	Comparative analysis of boosting algorithms for predicting personal default
title_sort	comparative analysis of boosting algorithms for predicting personal default
topic	Personal default prediction boosting algorithms feature importance analysis machine learning financial institutions Credit & Credit Institutions
url	https://www.tandfonline.com/doi/10.1080/23322039.2025.2465971
work_keys_str_mv	AT nhatnguyen comparativeanalysisofboostingalgorithmsforpredictingpersonaldefault AT duyngo comparativeanalysisofboostingalgorithmsforpredictingpersonaldefault

Comparative analysis of boosting algorithms for predicting personal default

Similar Items