TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE

The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...

Full description

Saved in:
Bibliographic Details
Main Author: Lê Thị Minh Nguyện
Format: Article
Language:English
Published: Dalat University 2019-06-01
Series:Tạp chí Khoa học Đại học Đà Lạt
Subjects:
Online Access:http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832572525591134208
author Lê Thị Minh Nguyện
author_facet Lê Thị Minh Nguyện
author_sort Lê Thị Minh Nguyện
collection DOAJ
description The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.
format Article
id doaj-art-0f8d32706b034fe3943321da712a4bb8
institution Kabale University
issn 0866-787X
0866-787X
language English
publishDate 2019-06-01
publisher Dalat University
record_format Article
series Tạp chí Khoa học Đại học Đà Lạt
spelling doaj-art-0f8d32706b034fe3943321da712a4bb82025-02-02T09:26:22ZengDalat UniversityTạp chí Khoa học Đại học Đà Lạt0866-787X0866-787X2019-06-019231910.37569/DalatUniversity.9.2.536(2019)275TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINELê Thị Minh Nguyện0The Faculty of Information Technology, Hochiminh City University of Foreign Languages - Information TechnologyThe development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536feature vectorkernalnaïve bayessupport vector machinetext classification.
spellingShingle Lê Thị Minh Nguyện
TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
Tạp chí Khoa học Đại học Đà Lạt
feature vector
kernal
naïve bayes
support vector machine
text classification.
title TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
title_full TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
title_fullStr TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
title_full_unstemmed TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
title_short TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
title_sort text classification based on support vector machine
topic feature vector
kernal
naïve bayes
support vector machine
text classification.
url http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536
work_keys_str_mv AT lethiminhnguyen textclassificationbasedonsupportvectormachine