RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning

Cost-sensitive learning is a popular paradigm to address class-imbalance learning (CIL) problem. Traditional cost-sensitive learning approaches always solve CIL problem by assigning a constant higher training error penalty for all minority instances than that of majority instances, but ignore the si...

Full description

Saved in:
Bibliographic Details
Main Authors: Shanlin Zhou, Yan Gu, Hualong Yu, Xibei Yang, Shang Gao
Format: Article
Language:English
Published: Springer 2023-04-01
Series:Journal of King Saud University: Computer and Information Sciences
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1319157823000617
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849307459948118016
author Shanlin Zhou
Yan Gu
Hualong Yu
Xibei Yang
Shang Gao
author_facet Shanlin Zhou
Yan Gu
Hualong Yu
Xibei Yang
Shang Gao
author_sort Shanlin Zhou
collection DOAJ
description Cost-sensitive learning is a popular paradigm to address class-imbalance learning (CIL) problem. Traditional cost-sensitive learning approaches always solve CIL problem by assigning a constant higher training error penalty for all minority instances than that of majority instances, but ignore the significance of location information. Therefore, several recent studies began to focus on the personalized cost assignment, i.e., designating different costs for different instances based on their location information. The emerging personalized cost-sensitive approaches always perform better than those traditional ones; however, the estimation for location information may be inaccurate as it is apt to be impacted by data density variation. To address this problem, we propose a novel location information estimation and cost assignment strategy called RUE. Unlike previous approaches, our proposed strategy explores location information by an indirect way: the error rate feed backed from a random undersampling ensemble. The strategy is robust towards data distribution, and is helpful for accurately estimating the significance of each instance regardless the complexity of data distribution. In context of Fuzzy Support Vector Machine (FSVM) and Weighted Extreme Learning Machine (WELM), the proposed cost assignment strategy is compared with several popular and state-of-the-art approaches, and the results show its effectiveness and superiority.
format Article
id doaj-art-214d332866eb4a0197acdc86b80fb079
institution Kabale University
issn 1319-1578
language English
publishDate 2023-04-01
publisher Springer
record_format Article
series Journal of King Saud University: Computer and Information Sciences
spelling doaj-art-214d332866eb4a0197acdc86b80fb0792025-08-20T03:54:47ZengSpringerJournal of King Saud University: Computer and Information Sciences1319-15782023-04-01354364910.1016/j.jksuci.2023.03.001RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learningShanlin Zhou0Yan Gu1Hualong Yu2Xibei Yang3Shang Gao4School of Computer, Jiangsu University of Science and Technology, Zhenjiang, ChinaSchool of Computer, Jiangsu University of Science and Technology, Zhenjiang, ChinaCorresponding author. No.666, Changhui Road, Jiangsu University of Science and Technology, Zhenjiang, Jiangsu, China.; School of Computer, Jiangsu University of Science and Technology, Zhenjiang, ChinaSchool of Computer, Jiangsu University of Science and Technology, Zhenjiang, ChinaSchool of Computer, Jiangsu University of Science and Technology, Zhenjiang, ChinaCost-sensitive learning is a popular paradigm to address class-imbalance learning (CIL) problem. Traditional cost-sensitive learning approaches always solve CIL problem by assigning a constant higher training error penalty for all minority instances than that of majority instances, but ignore the significance of location information. Therefore, several recent studies began to focus on the personalized cost assignment, i.e., designating different costs for different instances based on their location information. The emerging personalized cost-sensitive approaches always perform better than those traditional ones; however, the estimation for location information may be inaccurate as it is apt to be impacted by data density variation. To address this problem, we propose a novel location information estimation and cost assignment strategy called RUE. Unlike previous approaches, our proposed strategy explores location information by an indirect way: the error rate feed backed from a random undersampling ensemble. The strategy is robust towards data distribution, and is helpful for accurately estimating the significance of each instance regardless the complexity of data distribution. In context of Fuzzy Support Vector Machine (FSVM) and Weighted Extreme Learning Machine (WELM), the proposed cost assignment strategy is compared with several popular and state-of-the-art approaches, and the results show its effectiveness and superiority.http://www.sciencedirect.com/science/article/pii/S1319157823000617Cost-sensitive learningClass imbalance learningRandom undersampling ensembleFuzzy support vector machineWeighted extreme learning machine
spellingShingle Shanlin Zhou
Yan Gu
Hualong Yu
Xibei Yang
Shang Gao
RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
Journal of King Saud University: Computer and Information Sciences
Cost-sensitive learning
Class imbalance learning
Random undersampling ensemble
Fuzzy support vector machine
Weighted extreme learning machine
title RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
title_full RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
title_fullStr RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
title_full_unstemmed RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
title_short RUE: A robust personalized cost assignment strategy for class imbalance cost-sensitive learning
title_sort rue a robust personalized cost assignment strategy for class imbalance cost sensitive learning
topic Cost-sensitive learning
Class imbalance learning
Random undersampling ensemble
Fuzzy support vector machine
Weighted extreme learning machine
url http://www.sciencedirect.com/science/article/pii/S1319157823000617
work_keys_str_mv AT shanlinzhou ruearobustpersonalizedcostassignmentstrategyforclassimbalancecostsensitivelearning
AT yangu ruearobustpersonalizedcostassignmentstrategyforclassimbalancecostsensitivelearning
AT hualongyu ruearobustpersonalizedcostassignmentstrategyforclassimbalancecostsensitivelearning
AT xibeiyang ruearobustpersonalizedcostassignmentstrategyforclassimbalancecostsensitivelearning
AT shanggao ruearobustpersonalizedcostassignmentstrategyforclassimbalancecostsensitivelearning