Neither Corpus Nor Edition: Building a Pipeline to Make Data Analysis Possible on Medieval Arabic Commentary Traditions
We have built a suite of tools in Python to proficiently analyze text reuse and intertextuality for a specific kind of set of medieval Arabic texts (commentaries) available in print. We take these printed editions, scan them, pre-process the images, give it to an OCR engine, clean the results, and s...
Saved in:
| Main Authors: | Cornelis van Lit, Dirk Roorda |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Department of Languages, Literatures, and Cultures at McGill University
2024-06-01
|
| Series: | Journal of Cultural Analytics |
| Online Access: | https://doi.org/10.22148/001c.116372 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Anabaptism is Neither Catholic nor Protestant
by: Constantine PROKHOROV
Published: (2016-06-01) -
Pseudorandom unitaries are neither real nor sparse nor noise-robust
by: Tobias Haug, et al.
Published: (2025-06-01) -
Neither sword nor pen: phallacious impotence
by: Eliana de Souza Ávila
Published: (2005-01-01) -
Revisiting communitarianism: neither liberal nor authoritarian
by: Ömer Faruk Uysal
Published: (2025-07-01) -
“Neither Here, nor There”: Riverscapes in Films on Migration
by: Mirna Šolić
Published: (2025-02-01)