Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation
Abstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time sp...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | English |
Published: |
SpringerOpen
2025-01-01
|
Series: | Financial Innovation |
Subjects: | |
Online Access: | https://doi.org/10.1186/s40854-024-00695-3 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841544397263470592 |
---|---|
author | Xiaoming Zhang Lean Yu Hang Yin |
author_facet | Xiaoming Zhang Lean Yu Hang Yin |
author_sort | Xiaoming Zhang |
collection | DOAJ |
description | Abstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time span may cause differences in the data distribution, and classic supervised learning methods may exhibit poor predictive abilities under such conditions. To address these issues, a domain-adaptation-based multistage ensemble learning paradigm (DAMEL) is proposed in this study to evaluate the credit risk of SMEs in SCF. In this methodology, a bagging resampling algorithm is first used to generate a dataset to address data scarcity. Subsequently, a random subspace is applied to integrate various features and reduce feature redundancy. Additionally, a domain adaptation approach is utilized to reduce the data distribution discrepancy in the cross-domain. Finally, dynamic model selection is developed to improve the generalization ability of the model in the fourth stage. A real-world credit dataset from the Chinese securities market was used to validate the effectiveness and feasibility of the multistage ensemble learning paradigm. The experimental results demonstrated that the proposed domain-adaptation-based multistage ensemble learning paradigm is superior to principal component analysis, joint distribution adaptation, random forest, and other ensemble and transfer learning methods. Moreover, dynamic model selection can improve the model generalization performance and prediction precision of minority samples. This can be considered a promising solution for evaluating the credit risk of SMEs in SCF for financial institutions. |
format | Article |
id | doaj-art-ea105e7818384d76971da11d37b8405b |
institution | Kabale University |
issn | 2199-4730 |
language | English |
publishDate | 2025-01-01 |
publisher | SpringerOpen |
record_format | Article |
series | Financial Innovation |
spelling | doaj-art-ea105e7818384d76971da11d37b8405b2025-01-12T12:36:15ZengSpringerOpenFinancial Innovation2199-47302025-01-0111112810.1186/s40854-024-00695-3Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluationXiaoming Zhang0Lean Yu1Hang Yin2School of Information Management and Mathematics, Jiangxi University of Finance and EconomicsBusiness School, Sichuan UniversitySchool of Economics and Management, Harbin Engineering UniversityAbstract Machine learning methods are widely used to evaluate the risk of small- and medium-sized enterprises (SMEs) in supply chain finance (SCF). However, there may be problems with data scarcity, feature redundancy, and poor predictive performance. Additionally, data collected over a long time span may cause differences in the data distribution, and classic supervised learning methods may exhibit poor predictive abilities under such conditions. To address these issues, a domain-adaptation-based multistage ensemble learning paradigm (DAMEL) is proposed in this study to evaluate the credit risk of SMEs in SCF. In this methodology, a bagging resampling algorithm is first used to generate a dataset to address data scarcity. Subsequently, a random subspace is applied to integrate various features and reduce feature redundancy. Additionally, a domain adaptation approach is utilized to reduce the data distribution discrepancy in the cross-domain. Finally, dynamic model selection is developed to improve the generalization ability of the model in the fourth stage. A real-world credit dataset from the Chinese securities market was used to validate the effectiveness and feasibility of the multistage ensemble learning paradigm. The experimental results demonstrated that the proposed domain-adaptation-based multistage ensemble learning paradigm is superior to principal component analysis, joint distribution adaptation, random forest, and other ensemble and transfer learning methods. Moreover, dynamic model selection can improve the model generalization performance and prediction precision of minority samples. This can be considered a promising solution for evaluating the credit risk of SMEs in SCF for financial institutions.https://doi.org/10.1186/s40854-024-00695-3Joint distribution adaptationEnsemble learningSupply chain financeSmall and medium-sized enterprisesCredit risk evaluation |
spellingShingle | Xiaoming Zhang Lean Yu Hang Yin Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation Financial Innovation Joint distribution adaptation Ensemble learning Supply chain finance Small and medium-sized enterprises Credit risk evaluation |
title | Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation |
title_full | Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation |
title_fullStr | Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation |
title_full_unstemmed | Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation |
title_short | Domain adaptation-based multistage ensemble learning paradigm for credit risk evaluation |
title_sort | domain adaptation based multistage ensemble learning paradigm for credit risk evaluation |
topic | Joint distribution adaptation Ensemble learning Supply chain finance Small and medium-sized enterprises Credit risk evaluation |
url | https://doi.org/10.1186/s40854-024-00695-3 |
work_keys_str_mv | AT xiaomingzhang domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation AT leanyu domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation AT hangyin domainadaptationbasedmultistageensemblelearningparadigmforcreditriskevaluation |