Research on text sentiment classification based on improved feature selection method
An improved information gain feature selection method based on sentiment dictionary was proposed.Firstly,aiming at the existing problems of information gain feature selection,such as paying attention to the frequency of feature word and ignoring the balance of corpus,an improved method was proposed....
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Language: | zho |
Published: |
Beijing Xintong Media Co., Ltd
2018-10-01
|
Series: | Dianxin kexue |
Subjects: | |
Online Access: | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018250/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1841530477463207936 |
---|---|
author | Mingxin LIU Jing CHEN Qiyuan WANG |
author_facet | Mingxin LIU Jing CHEN Qiyuan WANG |
author_sort | Mingxin LIU |
collection | DOAJ |
description | An improved information gain feature selection method based on sentiment dictionary was proposed.Firstly,aiming at the existing problems of information gain feature selection,such as paying attention to the frequency of feature word and ignoring the balance of corpus,an improved method was proposed.Secondly,considering the influence of sentiment words in text classification,a feature selection method IGSC (information gain combining sentiment classification) based on sentiment dictionary was proposed for text classification.By matching the text emotion words and combining the weight of emotion words,the feature dimension reduction was realized and the problem of text data sparseness affecting classification performance was solved.Finally,according to the proposed feature selection method of travel review data set for experimental verification and analysis,the experimental results show that the improved text sentiment classification feature selection method has been improved in terms of classification accuracy,recall and F value,and classification has better stability. |
format | Article |
id | doaj-art-caa05b5f1b09456880986eef62e2d535 |
institution | Kabale University |
issn | 1000-0801 |
language | zho |
publishDate | 2018-10-01 |
publisher | Beijing Xintong Media Co., Ltd |
record_format | Article |
series | Dianxin kexue |
spelling | doaj-art-caa05b5f1b09456880986eef62e2d5352025-01-15T03:03:57ZzhoBeijing Xintong Media Co., LtdDianxin kexue1000-08012018-10-0134859559593414Research on text sentiment classification based on improved feature selection methodMingxin LIUJing CHENQiyuan WANGAn improved information gain feature selection method based on sentiment dictionary was proposed.Firstly,aiming at the existing problems of information gain feature selection,such as paying attention to the frequency of feature word and ignoring the balance of corpus,an improved method was proposed.Secondly,considering the influence of sentiment words in text classification,a feature selection method IGSC (information gain combining sentiment classification) based on sentiment dictionary was proposed for text classification.By matching the text emotion words and combining the weight of emotion words,the feature dimension reduction was realized and the problem of text data sparseness affecting classification performance was solved.Finally,according to the proposed feature selection method of travel review data set for experimental verification and analysis,the experimental results show that the improved text sentiment classification feature selection method has been improved in terms of classification accuracy,recall and F value,and classification has better stability.http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018250/information gainsentiment dictionaryfeature selectionsentiment classification |
spellingShingle | Mingxin LIU Jing CHEN Qiyuan WANG Research on text sentiment classification based on improved feature selection method Dianxin kexue information gain sentiment dictionary feature selection sentiment classification |
title | Research on text sentiment classification based on improved feature selection method |
title_full | Research on text sentiment classification based on improved feature selection method |
title_fullStr | Research on text sentiment classification based on improved feature selection method |
title_full_unstemmed | Research on text sentiment classification based on improved feature selection method |
title_short | Research on text sentiment classification based on improved feature selection method |
title_sort | research on text sentiment classification based on improved feature selection method |
topic | information gain sentiment dictionary feature selection sentiment classification |
url | http://www.telecomsci.com/zh/article/doi/10.11959/j.issn.1000-0801.2018250/ |
work_keys_str_mv | AT mingxinliu researchontextsentimentclassificationbasedonimprovedfeatureselectionmethod AT jingchen researchontextsentimentclassificationbasedonimprovedfeatureselectionmethod AT qiyuanwang researchontextsentimentclassificationbasedonimprovedfeatureselectionmethod |