Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection

Semantic role labeling involves assigning semantic roles to sentence arguments, providing rich information for various NLP tasks and applications. Annotated corpora with semantic roles are a critical factor in improving the performance of semantic-based models. Besides, Arabic as a low resourced lan...

Full description

Saved in:
Bibliographic Details
Main Authors: Ferial Senator, Abdelaziz Lakhfif, Imene Zenbout, Hanane Boutouta, Chahrazed Mediani
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10820541/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1841550793300246528
author Ferial Senator
Abdelaziz Lakhfif
Imene Zenbout
Hanane Boutouta
Chahrazed Mediani
author_facet Ferial Senator
Abdelaziz Lakhfif
Imene Zenbout
Hanane Boutouta
Chahrazed Mediani
author_sort Ferial Senator
collection DOAJ
description Semantic role labeling involves assigning semantic roles to sentence arguments, providing rich information for various NLP tasks and applications. Annotated corpora with semantic roles are a critical factor in improving the performance of semantic-based models. Besides, Arabic as a low resourced language, have to pay more attention to alternative methods to build such annotated corpora. To this end, two traditional methods have been intensively used, namely, manual annotation and crowed-resourced annotation. The former is highly precise but it demands substantial training and extensive resources, while, the latter, reduce human effort but often results in lower-quality annotations. Recently, Large language model (LLM) based conversational systems like ChatGPT have emerged as a promising tool for text annotation across various NLP tasks. In this paper, we leverage ChatGPT for two main sub-tasks in Arabic language processing. <xref ref-type="disp-formula" rid="deqn1">(1)</xref> Creating an Arabic annotated resource with emotional semantic roles from an English corpus, using cross-lingual annotation projection approach. <xref ref-type="disp-formula" rid="deqn2">(2)</xref> Annotating the Arabic corpus of emotional sentences with emotion categories and semantic roles. Furthermore, we evaluate ChatGPT&#x2019;s potential for translating English sentences into Arabic. From the perspective of generalization, we test the performance of open-LLMs, specifically, mBERT, and mBART for the same tasks. The evaluation process includes assessing the impact of sentence complexity on the performance of ChatGPT, and open-LLMs in semantic role labeling, and cross-lingual annotation projection. We compared the obtained zero-shot annotation accuracy with that of human base annotations, where the GPT results achieved an accuracy of 0.94 for cross-lingual projection and 0.76 in semantic role labelling, While the open-LLMs achieved notable accuracies of 0.72, and 0.38 respectively.
format Article
id doaj-art-b5c84c2499e84dcca79c236c0b363d1e
institution Kabale University
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-b5c84c2499e84dcca79c236c0b363d1e2025-01-10T00:01:47ZengIEEEIEEE Access2169-35362025-01-01133707372510.1109/ACCESS.2025.352549310820541Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation ProjectionFerial Senator0https://orcid.org/0009-0004-3024-779XAbdelaziz Lakhfif1Imene Zenbout2https://orcid.org/0000-0002-5168-2449Hanane Boutouta3https://orcid.org/0009-0001-5255-0851Chahrazed Mediani4https://orcid.org/0009-0004-7222-393XDepartment of Computer Science, Faculty of Sciences, Setif 1 University-Ferhat ABBAS, Setif, AlgeriaDepartment of Computer Science, Faculty of Sciences, Setif 1 University-Ferhat ABBAS, Setif, AlgeriaDepartment of Computer Science, Faculty of Sciences, Setif 1 University-Ferhat ABBAS, Setif, AlgeriaDepartment of Computer Science, Faculty of Sciences, Setif 1 University-Ferhat ABBAS, Setif, AlgeriaDepartment of Computer Science, Faculty of Sciences, Setif 1 University-Ferhat ABBAS, Setif, AlgeriaSemantic role labeling involves assigning semantic roles to sentence arguments, providing rich information for various NLP tasks and applications. Annotated corpora with semantic roles are a critical factor in improving the performance of semantic-based models. Besides, Arabic as a low resourced language, have to pay more attention to alternative methods to build such annotated corpora. To this end, two traditional methods have been intensively used, namely, manual annotation and crowed-resourced annotation. The former is highly precise but it demands substantial training and extensive resources, while, the latter, reduce human effort but often results in lower-quality annotations. Recently, Large language model (LLM) based conversational systems like ChatGPT have emerged as a promising tool for text annotation across various NLP tasks. In this paper, we leverage ChatGPT for two main sub-tasks in Arabic language processing. <xref ref-type="disp-formula" rid="deqn1">(1)</xref> Creating an Arabic annotated resource with emotional semantic roles from an English corpus, using cross-lingual annotation projection approach. <xref ref-type="disp-formula" rid="deqn2">(2)</xref> Annotating the Arabic corpus of emotional sentences with emotion categories and semantic roles. Furthermore, we evaluate ChatGPT&#x2019;s potential for translating English sentences into Arabic. From the perspective of generalization, we test the performance of open-LLMs, specifically, mBERT, and mBART for the same tasks. The evaluation process includes assessing the impact of sentence complexity on the performance of ChatGPT, and open-LLMs in semantic role labeling, and cross-lingual annotation projection. We compared the obtained zero-shot annotation accuracy with that of human base annotations, where the GPT results achieved an accuracy of 0.94 for cross-lingual projection and 0.76 in semantic role labelling, While the open-LLMs achieved notable accuracies of 0.72, and 0.38 respectively.https://ieeexplore.ieee.org/document/10820541/Semantic role labelingcross-lingual annotation projectionemotion analysisChatGPTArabic language
spellingShingle Ferial Senator
Abdelaziz Lakhfif
Imene Zenbout
Hanane Boutouta
Chahrazed Mediani
Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
IEEE Access
Semantic role labeling
cross-lingual annotation projection
emotion analysis
ChatGPT
Arabic language
title Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
title_full Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
title_fullStr Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
title_full_unstemmed Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
title_short Leveraging ChatGPT for Enhancing Arabic NLP: Application for Semantic Role Labeling and Cross-Lingual Annotation Projection
title_sort leveraging chatgpt for enhancing arabic nlp application for semantic role labeling and cross lingual annotation projection
topic Semantic role labeling
cross-lingual annotation projection
emotion analysis
ChatGPT
Arabic language
url https://ieeexplore.ieee.org/document/10820541/
work_keys_str_mv AT ferialsenator leveragingchatgptforenhancingarabicnlpapplicationforsemanticrolelabelingandcrosslingualannotationprojection
AT abdelazizlakhfif leveragingchatgptforenhancingarabicnlpapplicationforsemanticrolelabelingandcrosslingualannotationprojection
AT imenezenbout leveragingchatgptforenhancingarabicnlpapplicationforsemanticrolelabelingandcrosslingualannotationprojection
AT hananeboutouta leveragingchatgptforenhancingarabicnlpapplicationforsemanticrolelabelingandcrosslingualannotationprojection
AT chahrazedmediani leveragingchatgptforenhancingarabicnlpapplicationforsemanticrolelabelingandcrosslingualannotationprojection