Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System

Speech emotion recognition (SER) is a challenging task due to the complex and subtle nature of emotions. This study proposes a novel approach for emotion modeling using speech signals by combining discrete wavelet transform (DWT) with linear prediction coding (LPC). The performance of various classi...

Full description

Saved in:

Bibliographic Details
Main Authors:	K. Daqrouq, A. Balamesh, O. Alrusaini, A. Alkhateeb, A. S. Balamash
Format:	Article
Language:	English
Published:	Wiley 2024-01-01
Series:	Applied Computational Intelligence and Soft Computing
Online Access:	http://dx.doi.org/10.1155/2024/7184018
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849744389534908416
author	K. Daqrouq A. Balamesh O. Alrusaini A. Alkhateeb A. S. Balamash
author_facet	K. Daqrouq A. Balamesh O. Alrusaini A. Alkhateeb A. S. Balamash
author_sort	K. Daqrouq
collection	DOAJ
description	Speech emotion recognition (SER) is a challenging task due to the complex and subtle nature of emotions. This study proposes a novel approach for emotion modeling using speech signals by combining discrete wavelet transform (DWT) with linear prediction coding (LPC). The performance of various classifiers, including support vector machine (SVM), K-Nearest Neighbors (KNN), Efficient Logistic Regression, Naive Bayes, Ensemble, and Neural Network, was evaluated for emotion classification using the EMO-DB dataset. Evaluation metrics such as area under the curve (AUC), average prediction accuracy, and cross-validation techniques were employed. The results indicate that KNN and SVM classifiers exhibited high accuracy in distinguishing sadness from other emotions. Ensemble methods and Neural Networks also demonstrated strong performance in sadness classification. While Efficient Logistic Regression and Naive Bayes classifiers showed competitive performance, they were slightly less accurate compared to other classifiers. Furthermore, the proposed feature extraction method yielded the highest average accuracy, and its combination with formants or wavelet entropy further improved classification accuracy. On the other hand, Efficient Logistic Regression exhibited the lowest accuracies among the classifiers. The uniqueness of this study was that it investigated a combined feature extraction method and integrated them to compare with various forms of combinations. However, the purposes of the investigation include improved performance of the classifiers, high effectiveness of the system, and the potential for emotion classification tasks. These findings can guide the selection of appropriate classifiers and feature extraction methods in future research and real-world applications. Further investigations can focus on refining classifiers and exploring additional feature extraction techniques to enhance emotion classification accuracy.
format	Article
id	doaj-art-a3669dc6cb844a4b907a0e62881cdfa6
institution	DOAJ
issn	1687-9732
language	English
publishDate	2024-01-01
publisher	Wiley
record_format	Article
series	Applied Computational Intelligence and Soft Computing
spelling	doaj-art-a3669dc6cb844a4b907a0e62881cdfa62025-08-20T03:19:57ZengWileyApplied Computational Intelligence and Soft Computing1687-97322024-01-01202410.1155/2024/7184018Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition SystemK. Daqrouq0A. Balamesh1O. Alrusaini2A. Alkhateeb3A. S. Balamash4Department of Electrical and Computer EngineeringDepartment of Electrical and Computer EngineeringDepartment of Engineering and Applied SciencesDepartment of Electrical and Computer EngineeringDepartment of Electrical and Computer EngineeringSpeech emotion recognition (SER) is a challenging task due to the complex and subtle nature of emotions. This study proposes a novel approach for emotion modeling using speech signals by combining discrete wavelet transform (DWT) with linear prediction coding (LPC). The performance of various classifiers, including support vector machine (SVM), K-Nearest Neighbors (KNN), Efficient Logistic Regression, Naive Bayes, Ensemble, and Neural Network, was evaluated for emotion classification using the EMO-DB dataset. Evaluation metrics such as area under the curve (AUC), average prediction accuracy, and cross-validation techniques were employed. The results indicate that KNN and SVM classifiers exhibited high accuracy in distinguishing sadness from other emotions. Ensemble methods and Neural Networks also demonstrated strong performance in sadness classification. While Efficient Logistic Regression and Naive Bayes classifiers showed competitive performance, they were slightly less accurate compared to other classifiers. Furthermore, the proposed feature extraction method yielded the highest average accuracy, and its combination with formants or wavelet entropy further improved classification accuracy. On the other hand, Efficient Logistic Regression exhibited the lowest accuracies among the classifiers. The uniqueness of this study was that it investigated a combined feature extraction method and integrated them to compare with various forms of combinations. However, the purposes of the investigation include improved performance of the classifiers, high effectiveness of the system, and the potential for emotion classification tasks. These findings can guide the selection of appropriate classifiers and feature extraction methods in future research and real-world applications. Further investigations can focus on refining classifiers and exploring additional feature extraction techniques to enhance emotion classification accuracy.http://dx.doi.org/10.1155/2024/7184018
spellingShingle	K. Daqrouq A. Balamesh O. Alrusaini A. Alkhateeb A. S. Balamash Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System Applied Computational Intelligence and Soft Computing
title	Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System
title_full	Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System
title_fullStr	Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System
title_full_unstemmed	Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System
title_short	Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System
title_sort	emotion modeling in speech signals discrete wavelet transform and machine learning tools for emotion recognition system
url	http://dx.doi.org/10.1155/2024/7184018
work_keys_str_mv	AT kdaqrouq emotionmodelinginspeechsignalsdiscretewavelettransformandmachinelearningtoolsforemotionrecognitionsystem AT abalamesh emotionmodelinginspeechsignalsdiscretewavelettransformandmachinelearningtoolsforemotionrecognitionsystem AT oalrusaini emotionmodelinginspeechsignalsdiscretewavelettransformandmachinelearningtoolsforemotionrecognitionsystem AT aalkhateeb emotionmodelinginspeechsignalsdiscretewavelettransformandmachinelearningtoolsforemotionrecognitionsystem AT asbalamash emotionmodelinginspeechsignalsdiscretewavelettransformandmachinelearningtoolsforemotionrecognitionsystem

Emotion Modeling in Speech Signals: Discrete Wavelet Transform and Machine Learning Tools for Emotion Recognition System

Similar Items