TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE
The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
Dalat University
2019-06-01
|
Series: | Tạp chí Khoa học Đại học Đà Lạt |
Subjects: | |
Online Access: | http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832572525591134208 |
---|---|
author | Lê Thị Minh Nguyện |
author_facet | Lê Thị Minh Nguyện |
author_sort | Lê Thị Minh Nguyện |
collection | DOAJ |
description | The development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process. |
format | Article |
id | doaj-art-0f8d32706b034fe3943321da712a4bb8 |
institution | Kabale University |
issn | 0866-787X 0866-787X |
language | English |
publishDate | 2019-06-01 |
publisher | Dalat University |
record_format | Article |
series | Tạp chí Khoa học Đại học Đà Lạt |
spelling | doaj-art-0f8d32706b034fe3943321da712a4bb82025-02-02T09:26:22ZengDalat UniversityTạp chí Khoa học Đại học Đà Lạt0866-787X0866-787X2019-06-019231910.37569/DalatUniversity.9.2.536(2019)275TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINELê Thị Minh Nguyện0The Faculty of Information Technology, Hochiminh City University of Foreign Languages - Information TechnologyThe development of the Internet has increased the need for daily online information storage. Finding the correct information that we are interested in takes a lot of time, so the use of techniques for organizing and processing text data are needed. These techniques are called text classification or text categorization. There are many methods of text classification, but for this paper we study and apply the Support Vector Machine (SVM) method and compare its effect with the Naïve Bayes probability method. In addition, before implementing text classification, we performed preprocessing steps on the training set by extracting keywords with dimensional reduction techniques to reduce the time needed in the classification process.http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536feature vectorkernalnaïve bayessupport vector machinetext classification. |
spellingShingle | Lê Thị Minh Nguyện TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE Tạp chí Khoa học Đại học Đà Lạt feature vector kernal naïve bayes support vector machine text classification. |
title | TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE |
title_full | TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE |
title_fullStr | TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE |
title_full_unstemmed | TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE |
title_short | TEXT CLASSIFICATION BASED ON SUPPORT VECTOR MACHINE |
title_sort | text classification based on support vector machine |
topic | feature vector kernal naïve bayes support vector machine text classification. |
url | http://tckh.dlu.edu.vn/index.php/tckhdhdl/article/view/536 |
work_keys_str_mv | AT lethiminhnguyen textclassificationbasedonsupportvectormachine |