An 81-million-word multi-genre corpus of Arabic booksSwedish National Data Serivice
This article describes The Arabic E-Book Corpus, a freely available Arabic corpus consisting of 1,745 books (81,5 million words) published by the Hindawi Foundation between 2008 and 2024. The books are of various genres, including fiction and non-fiction, children's literature, plays, and poetr...
Saved in:
| Main Author: | Andreas Hallberg |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-06-01
|
| Series: | Data in Brief |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S235234092500188X |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Research of Axiological Dominants in Press Release Genre based on Automatic Extraction of Key Words from Corpus
by: L. A. Kochetova, et al.
Published: (2019-06-01) -
L’organisation des expressions ritualisées dans le genre épistolaire : étude contrastive sur corpus en français et en chinois
by: Qianyun Li, et al.
Published: (2023-12-01) -
CORPUS-BASED ANALYSIS OF TRANSITION WORDS
by: Satyawati Surya, M.Pd.
Published: (2023-11-01) -
PROCESSING TOOLS FOR CORPUS LINGUISTICS: A CASE STUDY ON ARABIC HISTORICAL CORPUS
by: Bassam Hasan Hammo, et al.
Published: (2024-04-01) -
Tracing the scope of fear in corpus: similarities and differences in cross-domain/genre texts
by: Ignacio Rodríguez Sánchez, et al.
Published: (2024-12-01)