An 81-million-word multi-genre corpus of Arabic booksSwedish National Data Serivice

This article describes The Arabic E-Book Corpus, a freely available Arabic corpus consisting of 1,745 books (81,5 million words) published by the Hindawi Foundation between 2008 and 2024. The books are of various genres, including fiction and non-fiction, children's literature, plays, and poetr...

Full description

Saved in:
Bibliographic Details
Main Author: Andreas Hallberg
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Data in Brief
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S235234092500188X
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items