Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews

This paper presents an exploration of sentiment analysis applied to Vietnamese phone reviews, leveraging the PhoBERT model. While significant advancements have been made in sentiment analysis for English and other widely spoken languages, Vietnamese remains relatively under investigated. Our study...

Full description

Saved in:
Bibliographic Details
Main Authors: Tan Minh Ngo, Ba Hung Ngo, Stuchilin Vladimir Valerievich
Format: Article
Language:English
Published: Can Tho University Publisher 2024-10-01
Series:CTU Journal of Innovation and Sustainable Development
Subjects:
Online Access:http://web2010.thanhtoan/index.php/ctujs/article/view/1146
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849434545179328512
author Tan Minh Ngo
Ba Hung Ngo
Stuchilin Vladimir Valerievich
author_facet Tan Minh Ngo
Ba Hung Ngo
Stuchilin Vladimir Valerievich
author_sort Tan Minh Ngo
collection DOAJ
description This paper presents an exploration of sentiment analysis applied to Vietnamese phone reviews, leveraging the PhoBERT model. While significant advancements have been made in sentiment analysis for English and other widely spoken languages, Vietnamese remains relatively under investigated. Our study addresses this gap by constructing a comprehensive dataset that integrates data from the UIT-ViSFD dataset and data collected through web scraping. We experimented with various models including naive Bayes, Support Vector Machine, and PhoBERT, utilizing multiple data preprocessing techniques. PhoBERT, a state-of-the-art pre-trained language model specifically designed for Vietnamese, demonstrated superior performance. The final PhoBERT model with optimized preprocessing achieved an accuracy of 92.74%, highlighting its efficacy in accurately identifying sentiments.
format Article
id doaj-art-8b4d9d6ac952451291c43e17406eaa7f
institution Kabale University
issn 2588-1418
2815-6412
language English
publishDate 2024-10-01
publisher Can Tho University Publisher
record_format Article
series CTU Journal of Innovation and Sustainable Development
spelling doaj-art-8b4d9d6ac952451291c43e17406eaa7f2025-08-20T03:26:35ZengCan Tho University PublisherCTU Journal of Innovation and Sustainable Development2588-14182815-64122024-10-0116Special issue: ISDSFine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviewsTan Minh Ngo0Ba Hung NgoStuchilin Vladimir Valerievich1Can Tho University, Vietnam & National University of Science and Technology, RusiaNational University of Science and Technology MISIS, Moscow, Russia This paper presents an exploration of sentiment analysis applied to Vietnamese phone reviews, leveraging the PhoBERT model. While significant advancements have been made in sentiment analysis for English and other widely spoken languages, Vietnamese remains relatively under investigated. Our study addresses this gap by constructing a comprehensive dataset that integrates data from the UIT-ViSFD dataset and data collected through web scraping. We experimented with various models including naive Bayes, Support Vector Machine, and PhoBERT, utilizing multiple data preprocessing techniques. PhoBERT, a state-of-the-art pre-trained language model specifically designed for Vietnamese, demonstrated superior performance. The final PhoBERT model with optimized preprocessing achieved an accuracy of 92.74%, highlighting its efficacy in accurately identifying sentiments. http://web2010.thanhtoan/index.php/ctujs/article/view/1146Fine-tuned PhoBERT, natural language processing, sentiment analysis, text classification, Vietnamese language
spellingShingle Tan Minh Ngo
Ba Hung Ngo
Stuchilin Vladimir Valerievich
Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
CTU Journal of Innovation and Sustainable Development
Fine-tuned PhoBERT, natural language processing, sentiment analysis, text classification, Vietnamese language
title Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
title_full Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
title_fullStr Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
title_full_unstemmed Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
title_short Fine-tuned PhoBERT for sentiment analysis of Vietnamese phone reviews
title_sort fine tuned phobert for sentiment analysis of vietnamese phone reviews
topic Fine-tuned PhoBERT, natural language processing, sentiment analysis, text classification, Vietnamese language
url http://web2010.thanhtoan/index.php/ctujs/article/view/1146
work_keys_str_mv AT tanminhngo finetunedphobertforsentimentanalysisofvietnamesephonereviews
AT bahungngo finetunedphobertforsentimentanalysisofvietnamesephonereviews
AT stuchilinvladimirvalerievich finetunedphobertforsentimentanalysisofvietnamesephonereviews