An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus

Data-driven models perform poorly on part-of-speech tagging problems with the square Hmong language, a low-resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local eva...

Full description

Saved in:
Bibliographic Details
Main Authors: Di-Wen Kang, Shao-Qiang Ye, Sharifah Zarith Rahmah Syed Ahmad, Li-Ping Mo, Feng Qin, Pan Zhou
Format: Article
Language:English
Published: University of Baghdad, College of Science for Women 2024-02-01
Series:مجلة بغداد للعلوم
Subjects:
Online Access:https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/9694
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849412185966510080
author Di-Wen Kang
Shao-Qiang Ye
Sharifah Zarith Rahmah Syed Ahmad
Li-Ping Mo
Feng Qin
Pan Zhou
author_facet Di-Wen Kang
Shao-Qiang Ye
Sharifah Zarith Rahmah Syed Ahmad
Li-Ping Mo
Feng Qin
Pan Zhou
author_sort Di-Wen Kang
collection DOAJ
description Data-driven models perform poorly on part-of-speech tagging problems with the square Hmong language, a low-resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local evaluation strategies for handling the square Hmong part-of-speech tagging problem. The experiment shows that the average accuracy of the proposed model is 6%, 8% more than HMM and BiLSTM-CRF models, respectively. Meanwhile, the average F1 of the proposed model is also 6%, 3% more than HMM and BiLSTM-CRF models, respectively.
format Article
id doaj-art-47f0f1a61bc24aec8f9cc336a501ca79
institution Kabale University
issn 2078-8665
2411-7986
language English
publishDate 2024-02-01
publisher University of Baghdad, College of Science for Women
record_format Article
series مجلة بغداد للعلوم
spelling doaj-art-47f0f1a61bc24aec8f9cc336a501ca792025-08-20T03:34:31ZengUniversity of Baghdad, College of Science for Womenمجلة بغداد للعلوم2078-86652411-79862024-02-01212(SI)10.21123/bsj.2024.9694An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong CorpusDi-Wen Kang0Shao-Qiang Ye1Sharifah Zarith Rahmah Syed Ahmad2Li-Ping Mo3Feng Qin4Pan Zhou5School of Communication and Electronic Engineering, Jishou University, Jishou, 416000, China.Faculty of Computing, Universiti Teknologi Malaysia, Johor, 80310, Malaysia & College of Information and Engineering, Hunan Applied Technology University, Changde, Hunan, 415000, China.Faculty of Computing, Universiti Teknologi Malaysia, Johor, 80310, Malaysia.College of Computer Science and Engineering, Jishou University, Jishou, Hunan, 416000, China.Faculty of Computing, Universiti Teknologi Malaysia, Johor, 80310, Malaysia.School of Communication and Electronic Engineering, Jishou University, Jishou, 416000, China. Data-driven models perform poorly on part-of-speech tagging problems with the square Hmong language, a low-resource corpus. This paper designs a weight evaluation function to reduce the influence of unknown words. It proposes an improved harmony search algorithm utilizing the roulette and local evaluation strategies for handling the square Hmong part-of-speech tagging problem. The experiment shows that the average accuracy of the proposed model is 6%, 8% more than HMM and BiLSTM-CRF models, respectively. Meanwhile, the average F1 of the proposed model is also 6%, 3% more than HMM and BiLSTM-CRF models, respectively. https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/9694Harmony Search Algorithm, Low-resource language, Optimization, Part-of-Speech tagging, Unknown words
spellingShingle Di-Wen Kang
Shao-Qiang Ye
Sharifah Zarith Rahmah Syed Ahmad
Li-Ping Mo
Feng Qin
Pan Zhou
An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
مجلة بغداد للعلوم
Harmony Search Algorithm, Low-resource language, Optimization, Part-of-Speech tagging, Unknown words
title An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
title_full An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
title_fullStr An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
title_full_unstemmed An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
title_short An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
title_sort adaptive harmony search part of speech tagger for square hmong corpus
topic Harmony Search Algorithm, Low-resource language, Optimization, Part-of-Speech tagging, Unknown words
url https://bsj.uobaghdad.edu.iq/index.php/BSJ/article/view/9694
work_keys_str_mv AT diwenkang anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT shaoqiangye anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT sharifahzarithrahmahsyedahmad anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT lipingmo anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT fengqin anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT panzhou anadaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT diwenkang adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT shaoqiangye adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT sharifahzarithrahmahsyedahmad adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT lipingmo adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT fengqin adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus
AT panzhou adaptiveharmonysearchpartofspeechtaggerforsquarehmongcorpus