Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation

Abstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time sp...

Full description

Saved in:
Bibliographic Details
Main Authors: Xiaoming Zhang, Lean Yu, Hang Yin
Format: Article
Language:English
Published: SpringerOpen 2025-01-01
Series:Financial Innovation
Subjects:
Online Access:https://doi.org/10.1186/s40854-024-00695-3
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841544397263470592
author Xiaoming Zhang
Lean Yu
Hang Yin
author_facet Xiaoming Zhang
Lean Yu
Hang Yin
author_sort Xiaoming Zhang
collection DOAJ
description Abstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time span may cause differences in the data distribution, and classic supervised learning methods may exhibit poor predictive abilities under such conditions. To address these issues, a domain-adaptation-based multistage ensemble learning paradigm (DAMEL) is proposed in this study to evaluate the credit risk of SMEs in SCF. In this methodology, a bagging resampling algorithm is first used to generate a dataset to address data scarcity. Subsequently, a random subspace is applied to integrate various features and reduce feature redundancy. Additionally, a domain adaptation approach is utilized to reduce the data distribution discrepancy in the cross-domain. Finally, dynamic model selection is developed to improve the generalization ability of the model in the fourth stage. A real-world credit dataset from the Chinese securities market was used to validate the effectiveness and feasibility of the multistage ensemble learning paradigm. The experimental results demonstrated that the proposed domain-adaptation-based multistage ensemble learning paradigm is superior to principal component analysis, joint distribution adaptation, random forest, and other ensemble and transfer learning methods. Moreover, dynamic model selection can improve the model generalization performance and prediction precision of minority samples. This can be considered a promising solution for evaluating the credit risk of SMEs in SCF for financial institutions.
format Article
id doaj-art-ea105e7818384d76971da11d37b8405b
institution Kabale University
issn 2199-4730
language English
publishDate 2025-01-01
publisher SpringerOpen
record_format Article
series Financial Innovation
spelling doaj-art-ea105e7818384d76971da11d37b8405b2025-01-12T12:36:15ZengSpringerOpenFinancial Innovation2199-47302025-01-0111112810.1186/s40854-024-00695-3Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluationXiaoming Zhang0Lean Yu1Hang Yin2School of Information Management and Mathematics, Jiangxi University of Finance and EconomicsBusiness School, Sichuan UniversitySchool of Economics and Management, Harbin Engineering UniversityAbstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time span may cause differences in the data distribution, and classic supervised learning methods may exhibit poor predictive abilities under such conditions. To address these issues, a domain-adaptation-based multistage ensemble learning paradigm (DAMEL) is proposed in this study to evaluate the credit risk of SMEs in SCF. In this methodology, a bagging resampling algorithm is first used to generate a dataset to address data scarcity. Subsequently, a random subspace is applied to integrate various features and reduce feature redundancy. Additionally, a domain adaptation approach is utilized to reduce the data distribution discrepancy in the cross-domain. Finally, dynamic model selection is developed to improve the generalization ability of the model in the fourth stage. A real-world credit dataset from the Chinese securities market was used to validate the effectiveness and feasibility of the multistage ensemble learning paradigm. The experimental results demonstrated that the proposed domain-adaptation-based multistage ensemble learning paradigm is superior to principal component analysis, joint distribution adaptation, random forest, and other ensemble and transfer learning methods. Moreover, dynamic model selection can improve the model generalization performance and prediction precision of minority samples. This can be considered a promising solution for evaluating the credit risk of SMEs in SCF for financial institutions.https://doi.org/10.1186/s40854-024-00695-3Joint distribution adaptationEnsemble learningSupply chain financeSmall and medium-sized enterprisesCredit risk evaluation
spellingShingle Xiaoming Zhang
Lean Yu
Hang Yin
Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
Financial Innovation
Joint distribution adaptation
Ensemble learning
Supply chain finance
Small and medium-sized enterprises
Credit risk evaluation
title Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
title_full Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
title_fullStr Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
title_full_unstemmed Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
title_short Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
title_sort domain adaptation based multistage ensemble learning paradigm for credit risk evaluation
topic Joint distribution adaptation
Ensemble learning
Supply chain finance
Small and medium-sized enterprises
Credit risk evaluation
url https://doi.org/10.1186/s40854-024-00695-3
work_keys_str_mv AT xiaomingzhang domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation
AT leanyu domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation
AT hangyin domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation