PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS

This paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experim...

Full description

Saved in:
Bibliographic Details
Main Authors: Bassam Hasan Hammo, Sane Yagi
Format: Article
Language:English
Published: Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT) 2024-04-01
Series:Jordanian Journal of Computers and Information Technology
Subjects:
Online Access:https://www.jjcit.org/?mno=199918
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1846148472453988352
author Bassam Hasan Hammo
Sane Yagi
author_facet Bassam Hasan Hammo
Sane Yagi
author_sort Bassam Hasan Hammo
collection DOAJ
description This paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experiments conducted on the corpus and the concordancer. Arabic has a rich literary and cultural heritage spanning thousands of years. The inclusion of digital resources and the advancement in natural language processing (NLP) technology have made Arabic historical corpora increasingly crucial for researchers and learners worldwide. By integrating HAC and its tools into Arabic language learning, learners can delve deeper into vocabulary and culture and gain valuable insights that improve their language skills and understanding of Arabic. This combination of human guidance and NLP technology makes learning an engaging and enjoyable experience, offering a dynamic and authentic way to master the Arabic language. [JJCIT 2024; 10(4.000): 393-411]
format Article
id doaj-art-8b941db2853a48afad287044f12d9abb
institution Kabale University
issn 2413-9351
2415-1076
language English
publishDate 2024-04-01
publisher Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
record_format Article
series Jordanian Journal of Computers and Information Technology
spelling doaj-art-8b941db2853a48afad287044f12d9abb2024-12-01T05:11:16ZengScientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)Jordanian Journal of Computers and Information Technology2413-93512415-10762024-04-0110439341110.5455/jjcit.71-1714507767199918PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUSBassam Hasan Hammo0Sane Yagi1Princess Sumaya University for Technology University of Sharjah, United Arab EmiratesThis paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experiments conducted on the corpus and the concordancer. Arabic has a rich literary and cultural heritage spanning thousands of years. The inclusion of digital resources and the advancement in natural language processing (NLP) technology have made Arabic historical corpora increasingly crucial for researchers and learners worldwide. By integrating HAC and its tools into Arabic language learning, learners can delve deeper into vocabulary and culture and gain valuable insights that improve their language skills and understanding of Arabic. This combination of human guidance and NLP technology makes learning an engaging and enjoyable experience, offering a dynamic and authentic way to master the Arabic language. [JJCIT 2024; 10(4.000): 393-411]https://www.jjcit.org/?mno=199918historical arabic corpuscorpus toolsconcordancelearning arabicdata normalizationsemantic shifting
spellingShingle Bassam Hasan Hammo
Sane Yagi
PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
Jordanian Journal of Computers and Information Technology
historical arabic corpus
corpus tools
concordance
learning arabic
data normalization
semantic shifting
title PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
title_full PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
title_fullStr PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
title_full_unstemmed PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
title_short PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
title_sort processing tools for corpus linguistics a case study on arabic historical corpus
topic historical arabic corpus
corpus tools
concordance
learning arabic
data normalization
semantic shifting
url https://www.jjcit.org/?mno=199918
work_keys_str_mv AT bassamhasanhammo processingtoolsforcorpuslinguisticsacasestudyonarabichistoricalcorpus
AT saneyagi processingtoolsforcorpuslinguisticsacasestudyonarabichistoricalcorpus