A multi-way parallel named entity annotated corpus for English, Tamil and Sinhala

This paper presents a multi-way parallel English-Tamil-Sinhala corpus annotated with Named Entities (NEs), where Sinhala and Tamil are low-resource languages. Using pre-trained multilingual Language Models (mLMs), we establish new benchmark Named Entity Recognition (NER) results on this dataset for...

Full description

Saved in:
Bibliographic Details
Main Authors: Surangika Ranathunga, Asanka Ranasinghe, Janaka Shamal, Ayodya Dandeniya, Rashmi Galappaththi, Malithi Samaraweera
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Natural Language Processing Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S2949719125000366
Tags: Add Tag
No Tags, Be the first to tag this record!