Automatic grammatical tagger for a Spanish–Mixtec parallel corpus
In this work, we developed the first intelligent automatic grammatical tagger for a Spanish–Mixtec parallel corpus in Mexico. The proposed tagger consists of multiple phases. We started by collecting a Spanish–Mixtec parallel corpus of 12,300 sentences. Then, we tokenized the corpus at the word leve...
Saved in:
| Main Authors: | Hermilo Santiago-Benito, Diana-Margarita Córdova-Esparza, Noé-Alejandro Castro-Sánchez, Juan Terven, Julio-Alejandro Romero-González, Teresa García-Ramirez |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-02-01
|
| Series: | SoftwareX |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2352711024003558 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
An Adaptive Harmony Search Part-of-Speech tagger for Square Hmong Corpus
by: Di-Wen Kang, et al.
Published: (2024-02-01) -
Mixtec–Spanish Parallel Text Dataset for Language Technology Development
by: Hermilo Santiago-Benito, et al.
Published: (2025-06-01) -
Corpus Bootstrapping for Syriac Linguistics
by: Charbel El-Khaissi
Published: (2024-09-01) -
Ear tag and PIT tag retention by white‐tailed deer
by: Emily H. Belser, et al.
Published: (2017-12-01) -
A framework for automatically generating composite keywords for geo-tagged street images
Published: (2025-01-01)