PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
This paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experim...
Saved in:
| Main Authors: | , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)
2024-04-01
|
| Series: | Jordanian Journal of Computers and Information Technology |
| Subjects: | |
| Online Access: | https://www.jjcit.org/?mno=199918 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1846148472453988352 |
|---|---|
| author | Bassam Hasan Hammo Sane Yagi |
| author_facet | Bassam Hasan Hammo Sane Yagi |
| author_sort | Bassam Hasan Hammo |
| collection | DOAJ |
| description | This paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experiments conducted on the corpus and the concordancer. Arabic has a rich literary and cultural heritage spanning thousands of years. The inclusion of digital resources and the advancement in natural language processing (NLP) technology have made Arabic historical corpora increasingly crucial for researchers and learners worldwide. By integrating HAC and its tools into Arabic language learning, learners can delve deeper into vocabulary and culture and gain valuable insights that improve their language skills and understanding of Arabic. This combination of human guidance and NLP technology makes learning an engaging and enjoyable experience, offering a dynamic and authentic way to master the Arabic language. [JJCIT 2024; 10(4.000): 393-411] |
| format | Article |
| id | doaj-art-8b941db2853a48afad287044f12d9abb |
| institution | Kabale University |
| issn | 2413-9351 2415-1076 |
| language | English |
| publishDate | 2024-04-01 |
| publisher | Scientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT) |
| record_format | Article |
| series | Jordanian Journal of Computers and Information Technology |
| spelling | doaj-art-8b941db2853a48afad287044f12d9abb2024-12-01T05:11:16ZengScientific Research Support Fund of Jordan (SRSF) and Princess Sumaya University for Technology (PSUT)Jordanian Journal of Computers and Information Technology2413-93512415-10762024-04-0110439341110.5455/jjcit.71-1714507767199918PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUSBassam Hasan Hammo0Sane Yagi1Princess Sumaya University for Technology University of Sharjah, United Arab EmiratesThis paper explores the development, design, and reconstruction of a Historical Arabic Corpus (HAC), which covers more than 1600 years of uninterrupted language use. The study emphasizes the technical aspects followed to enhance the system and provide a usable concordancer, along with simple experiments conducted on the corpus and the concordancer. Arabic has a rich literary and cultural heritage spanning thousands of years. The inclusion of digital resources and the advancement in natural language processing (NLP) technology have made Arabic historical corpora increasingly crucial for researchers and learners worldwide. By integrating HAC and its tools into Arabic language learning, learners can delve deeper into vocabulary and culture and gain valuable insights that improve their language skills and understanding of Arabic. This combination of human guidance and NLP technology makes learning an engaging and enjoyable experience, offering a dynamic and authentic way to master the Arabic language. [JJCIT 2024; 10(4.000): 393-411]https://www.jjcit.org/?mno=199918historical arabic corpuscorpus toolsconcordancelearning arabicdata normalizationsemantic shifting |
| spellingShingle | Bassam Hasan Hammo Sane Yagi PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS Jordanian Journal of Computers and Information Technology historical arabic corpus corpus tools concordance learning arabic data normalization semantic shifting |
| title | PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS |
| title_full | PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS |
| title_fullStr | PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS |
| title_full_unstemmed | PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS |
| title_short | PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS |
| title_sort | processing tools for corpus linguistics a case study on arabic historical corpus |
| topic | historical arabic corpus corpus tools concordance learning arabic data normalization semantic shifting |
| url | https://www.jjcit.org/?mno=199918 |
| work_keys_str_mv | AT bassamhasanhammo processingtoolsforcorpuslinguisticsacasestudyonarabichistoricalcorpus AT saneyagi processingtoolsforcorpuslinguisticsacasestudyonarabichistoricalcorpus |