Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model

Depression is one of the most dangerous mental health conditions, often leading to suicide, which is the fourth leading cause of death in the Middle East. Particularly, Egypt has the highest suicide rate in the region, making it crucial to recognize depression and suicidal thoughts early. In Arab cu...

Full description

Saved in:
Bibliographic Details
Main Authors: Mahmoud M. Selim, Mohammed S. Assiri
Format: Article
Language:English
Published: Elsevier 2025-04-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016825001784
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861188172185600
author Mahmoud M. Selim
Mohammed S. Assiri
author_facet Mahmoud M. Selim
Mohammed S. Assiri
author_sort Mahmoud M. Selim
collection DOAJ
description Depression is one of the most dangerous mental health conditions, often leading to suicide, which is the fourth leading cause of death in the Middle East. Particularly, Egypt has the highest suicide rate in the region, making it crucial to recognize depression and suicidal thoughts early. In Arab culture, awareness of mental health issues is limited, but in recent years, people have increasingly expressed their feelings on social media platforms. This shift presents an opportunity for mental health intervention through digital means. Furthermore, while facial expressions are not accessible to the blind and visually impaired, voice signals can convey emotional nuances, offering an alternative method for detecting mental health states. Natural Language Processing (NLP) and machine learning (ML) techniques provide powerful tools for analysing social media text data, helping detect emotional distress and providing timely support. By applying these technologies, AI-driven solutions can assist in understanding and addressing mental health concerns more inclusively. This study designs an Arabic Mood Changing and Depression Detection using the Artificial Hummingbird Optimization Algorithm with Deep Learning (AMCDD-AHODL) technique for visually impaired individuals. The AMCDD-AHODL technique detects different kinds of emotions and depression using Arabic tweets. After pre-processing, the word embedding process is carried out using the AraBERT model. Furthermore, the AMCDD-AHODL technique utilizes a hybrid LSTM+BiGRU model for the recognition and classification model. To improve the performance of the hybrid LSTM+BiGRU methodology, the AMCDD-AHODL technique comprises an AHO-based hyperparameter tuning process. Finally, the WaveNet model enhances the naturalness and clarity of text-to-speech synthesis, delivering high-quality, human-like audio output. The AMCDD-AHODL approach is examined using the Modern Standard Arabic dataset containing 1229 records. The performance validation of the AMCDD-AHODL approach portrayed a superior accuracy value of 95.80 % compared to the existing ML and DL models. Therefore, the AMCDD-AHODL technique is applied for the early identification of various kinds of depression that can decrease the distress from the illness and the stigma related to mental health problems.
format Article
id doaj-art-778b271cb3c04d04be90a0f389e3edff
institution Kabale University
issn 1110-0168
language English
publishDate 2025-04-01
publisher Elsevier
record_format Article
series Alexandria Engineering Journal
spelling doaj-art-778b271cb3c04d04be90a0f389e3edff2025-02-10T04:34:14ZengElsevierAlexandria Engineering Journal1110-01682025-04-01119493502Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning modelMahmoud M. Selim0Mohammed S. Assiri1Department of Mathematics, College of Science and Humanities, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia; King Salman Center for Disability Research, Riyadh 11614, Saudi Arabia; Corresponding author at: Department of Mathematics, College of Science and Humanities, Prince Sattam Bin Abdulaziz University, Al-Kharj 11942, Saudi Arabia.Department of Computer Science, College of Computer Engineering and Sciences, Prince Sattam Bin Abdulaziz University, P.O. BOX 16273, Al-Kharj ZIP 3963, Saudi Arabia; King Salman Center for Disability Research, Riyadh 11614, Saudi ArabiaDepression is one of the most dangerous mental health conditions, often leading to suicide, which is the fourth leading cause of death in the Middle East. Particularly, Egypt has the highest suicide rate in the region, making it crucial to recognize depression and suicidal thoughts early. In Arab culture, awareness of mental health issues is limited, but in recent years, people have increasingly expressed their feelings on social media platforms. This shift presents an opportunity for mental health intervention through digital means. Furthermore, while facial expressions are not accessible to the blind and visually impaired, voice signals can convey emotional nuances, offering an alternative method for detecting mental health states. Natural Language Processing (NLP) and machine learning (ML) techniques provide powerful tools for analysing social media text data, helping detect emotional distress and providing timely support. By applying these technologies, AI-driven solutions can assist in understanding and addressing mental health concerns more inclusively. This study designs an Arabic Mood Changing and Depression Detection using the Artificial Hummingbird Optimization Algorithm with Deep Learning (AMCDD-AHODL) technique for visually impaired individuals. The AMCDD-AHODL technique detects different kinds of emotions and depression using Arabic tweets. After pre-processing, the word embedding process is carried out using the AraBERT model. Furthermore, the AMCDD-AHODL technique utilizes a hybrid LSTM+BiGRU model for the recognition and classification model. To improve the performance of the hybrid LSTM+BiGRU methodology, the AMCDD-AHODL technique comprises an AHO-based hyperparameter tuning process. Finally, the WaveNet model enhances the naturalness and clarity of text-to-speech synthesis, delivering high-quality, human-like audio output. The AMCDD-AHODL approach is examined using the Modern Standard Arabic dataset containing 1229 records. The performance validation of the AMCDD-AHODL approach portrayed a superior accuracy value of 95.80 % compared to the existing ML and DL models. Therefore, the AMCDD-AHODL technique is applied for the early identification of various kinds of depression that can decrease the distress from the illness and the stigma related to mental health problems.http://www.sciencedirect.com/science/article/pii/S1110016825001784Arabic languageNatural language processingDepression DetectionArtificial Hummingbird OptimizationVisually Impaired IndividualsText-to-Speech Synthesizer
spellingShingle Mahmoud M. Selim
Mohammed S. Assiri
Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
Alexandria Engineering Journal
Arabic language
Natural language processing
Depression Detection
Artificial Hummingbird Optimization
Visually Impaired Individuals
Text-to-Speech Synthesizer
title Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
title_full Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
title_fullStr Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
title_full_unstemmed Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
title_short Enhancing Arabic text-to-speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
title_sort enhancing arabic text to speech synthesis for emotional expression in visually impaired individuals using the artificial hummingbird and hybrid deep learning model
topic Arabic language
Natural language processing
Depression Detection
Artificial Hummingbird Optimization
Visually Impaired Individuals
Text-to-Speech Synthesizer
url http://www.sciencedirect.com/science/article/pii/S1110016825001784
work_keys_str_mv AT mahmoudmselim enhancingarabictexttospeechsynthesisforemotionalexpressioninvisuallyimpairedindividualsusingtheartificialhummingbirdandhybriddeeplearningmodel
AT mohammedsassiri enhancingarabictexttospeechsynthesisforemotionalexpressioninvisuallyimpairedindividualsusingtheartificialhummingbirdandhybriddeeplearningmodel