Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas

Abstract Deep learning has significantly advanced the question-answering (QA) systems across various sectors. However, Arabic-language systems for Hajj-related fatwas (non-binding Islamic legal opinions issued by muftis) remain underdeveloped. This paper introduces Hajj-FQA, a benchmark Arabic datas...

Full description

Saved in:

Bibliographic Details
Main Authors:	Hayfa A. Aleid, Aqil M. Azmi
Format:	Article
Language:	English
Published:	Springer 2025-07-01
Series:	Journal of King Saud University: Computer and Information Sciences
Subjects:	Chatbots Arabic question-answering Arabic NLP Dataset Fatwa Hajj
Online Access:	https://doi.org/10.1007/s44443-025-00128-w
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849332037858623488
author	Hayfa A. Aleid Aqil M. Azmi
author_facet	Hayfa A. Aleid Aqil M. Azmi
author_sort	Hayfa A. Aleid
collection	DOAJ
description	Abstract Deep learning has significantly advanced the question-answering (QA) systems across various sectors. However, Arabic-language systems for Hajj-related fatwas (non-binding Islamic legal opinions issued by muftis) remain underdeveloped. This paper introduces Hajj-FQA, a benchmark Arabic dataset specifically designed to develop HajjBot - a specialized chatbot for fatwas QA during the annual Hajj pilgrimage. The dataset captures the unique linguistic and jurisprudential characteristics of pilgrims’ inquiries, enabling accurate, domain-specific responses. We present a comprehensive quantitative analysis of the dataset’s construction methodology and its distinctive question-answer patterns. Evaluation using multilingual and Arabic-specific language models across three tasks - machine reading comprehension (MRC), duplicate question detection (DQD), and duplicate answer detection (DAD) - with 10-fold cross-validation demonstrates the practical utility of Hajj-FQA. Results show exceptional performance in classification tasks (AraBERTv0.2 achieved a precision score of 99.19% for DQD and 99.26% for DAD) and strong extractive answering capability with an $$F\text {-score}$$ F -score of 72.78%. While generative performance reached $$\text {BERT-}F$$ BERT- F score of 71.4% (AraBART), MRC variability highlights challenges in religious reasoning. These findings establish Hajj-FQA as both: (1) a critical resource for developing specialized fatwa chatbots like HajjBot, and (2) a benchmark for Arabic religious QA systems. The dataset directly addresses the urgent need for accurate, automated fatwa assistance during Hajj, while providing insights for future improvements in Islamic NLP applications.
format	Article
id	doaj-art-68bb5f3cb1fa41c7813e6059dd425860
institution	Kabale University
issn	1319-1578 2213-1248
language	English
publishDate	2025-07-01
publisher	Springer
record_format	Article
series	Journal of King Saud University: Computer and Information Sciences
spelling	doaj-art-68bb5f3cb1fa41c7813e6059dd4258602025-08-20T03:46:20ZengSpringerJournal of King Saud University: Computer and Information Sciences1319-15782213-12482025-07-0137612810.1007/s44443-025-00128-wHajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwasHayfa A. Aleid0Aqil M. Azmi1Department of Computer Science, College of Computer & Information Sciences, King Saud UniversityDepartment of Computer Science, College of Computer & Information Sciences, King Saud UniversityAbstract Deep learning has significantly advanced the question-answering (QA) systems across various sectors. However, Arabic-language systems for Hajj-related fatwas (non-binding Islamic legal opinions issued by muftis) remain underdeveloped. This paper introduces Hajj-FQA, a benchmark Arabic dataset specifically designed to develop HajjBot - a specialized chatbot for fatwas QA during the annual Hajj pilgrimage. The dataset captures the unique linguistic and jurisprudential characteristics of pilgrims’ inquiries, enabling accurate, domain-specific responses. We present a comprehensive quantitative analysis of the dataset’s construction methodology and its distinctive question-answer patterns. Evaluation using multilingual and Arabic-specific language models across three tasks - machine reading comprehension (MRC), duplicate question detection (DQD), and duplicate answer detection (DAD) - with 10-fold cross-validation demonstrates the practical utility of Hajj-FQA. Results show exceptional performance in classification tasks (AraBERTv0.2 achieved a precision score of 99.19% for DQD and 99.26% for DAD) and strong extractive answering capability with an $$F\text {-score}$$ F -score of 72.78%. While generative performance reached $$\text {BERT-}F$$ BERT- F score of 71.4% (AraBART), MRC variability highlights challenges in religious reasoning. These findings establish Hajj-FQA as both: (1) a critical resource for developing specialized fatwa chatbots like HajjBot, and (2) a benchmark for Arabic religious QA systems. The dataset directly addresses the urgent need for accurate, automated fatwa assistance during Hajj, while providing insights for future improvements in Islamic NLP applications.https://doi.org/10.1007/s44443-025-00128-wChatbotsArabic question-answeringArabic NLPDatasetFatwaHajj
spellingShingle	Hayfa A. Aleid Aqil M. Azmi Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas Journal of King Saud University: Computer and Information Sciences Chatbots Arabic question-answering Arabic NLP Dataset Fatwa Hajj
title	Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
title_full	Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
title_fullStr	Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
title_full_unstemmed	Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
title_short	Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas
title_sort	hajj fqa a benchmark arabic dataset for developing question answering systems on hajj fatwas
topic	Chatbots Arabic question-answering Arabic NLP Dataset Fatwa Hajj
url	https://doi.org/10.1007/s44443-025-00128-w
work_keys_str_mv	AT hayfaaaleid hajjfqaabenchmarkarabicdatasetfordevelopingquestionansweringsystemsonhajjfatwas AT aqilmazmi hajjfqaabenchmarkarabicdatasetfordevelopingquestionansweringsystemsonhajjfatwas

Hajj-FQA: A benchmark Arabic dataset for developing question-answering systems on Hajj fatwas

Similar Items