Knowledge-based Word Tokenization System for Urdu
Word tokenization, a foundational step in natural language processing (NLP), is critical for tasks like part-of-speech tagging, named entity recognition, and parsing, as well as various independent NLP applications. In our tech-driven era, the exponential growth of textual data on the World Wide Web...
Saved in:
| Main Authors: | Asif Khan, Khairullah Khan, Wahab Khan, Sadiq Nawaz Khan, Rafiul Haq |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MMU Press
2024-06-01
|
| Series: | Journal of Informatics and Web Engineering |
| Subjects: | |
| Online Access: | https://journals.mmupress.com/index.php/jiwe/article/view/902 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Paraphrase detection for Urdu language text using fine-tune BiLSTM framework
by: Muhammad Ali Aslam, et al.
Published: (2025-05-01) -
A Computational Approach to Understanding Agglutinative Structures in Urdu
by: Muhammad Shoaib Tahir, et al.
Published: (2024-09-01) -
Automatic grammatical tagger for a Spanish–Mixtec parallel corpus
by: Hermilo Santiago-Benito, et al.
Published: (2025-02-01) -
UrduSER: A comprehensive dataset for speech emotion recognition in Urdu languageMendeley Data
by: Muhammad Zaheer Akhtar, et al.
Published: (2025-06-01) -
Selected Literary and Resourceful Websites of Urdu: A Survey
by: Muhammad Mohsin Khan, et al.
Published: (2021-12-01)