Application of Improved MDSMOTE and FC-SVM in Imbalanced Data Set Classification

On the network shopping evaluation data sets appear the phenomenon of extreme imbalance,inorder to improve the classification accuracy of the unbalanced data set,It should be improved from both the sample and the algorithm For one of the problem in MDSMOTE algorithm that when generating part of the...

Full description

Saved in:
Bibliographic Details
Main Authors: WEN Xue-yan, ZHAO Li-ying, XU Ke-sheng, LU Guang
Format: Article
Language:zho
Published: Harbin University of Science and Technology Publications 2018-08-01
Series:Journal of Harbin University of Science and Technology
Subjects:
Online Access:https://hlgxb.hrbust.edu.cn/#/digest?ArticleID=1561
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:On the network shopping evaluation data sets appear the phenomenon of extreme imbalance,inorder to improve the classification accuracy of the unbalanced data set,It should be improved from both the sample and the algorithm For one of the problem in MDSMOTE algorithm that when generating part of the new samples, wrong points sample can't be contained,the correct classification of the wrongly classified sample is added to the existing MDSMOTE algorithm to improve the quality of the samples. For that we can' t solve the problem of the hyper plane bias of the minority class in traditional FSVM on imbalanced data sets classification,positive and negative penalty coefficient and fuzzy factor are added the FSVM to improve the recognition rate of unbalanced data. The improved algorithm is used in the classification of JingDong online shopping commentary data set. The f- measure value of this algorithm is increased by 9. 13% on average,which indicates the feasibility and effectiveness of this method.
ISSN:1007-2683