UnifiedCut: A Simple and Efficient Neural Model for Thai, Burmese and Khmer Word Segmentation

Word segmentation is a critical task in natural language processing for southeast Asian Abugida languages, including Thai, Burmese, and Khmer. Existing approaches demonstrate that models using fixed-length windowed context inputs can achieve high segmentation accuracy; however, they often rely on lo...

Full description

Saved in:
Bibliographic Details
Main Authors: Yonghua Wen, Yantuan Xian, Yuehan Wang, Zhengtao Yu
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/23/11435
Tags: Add Tag
No Tags, Be the first to tag this record!