T_SRNET: A multimodal model based on convolutional neural network for emotional speech enhancement

Speech classification is a technology that can determine the emotional state conveyed by speech. It can support emotion-related applications and improve the human–computer interaction experience. However, the lack of high-quality speech annotation datasets makes it difficult for many models to provi...

Full description

Saved in:
Bibliographic Details
Main Authors: Shaoqiang Wang, Lei Feng, Li Zhang
Format: Article
Language:English
Published: Elsevier 2025-06-01
Series:Alexandria Engineering Journal
Subjects:
Online Access:http://www.sciencedirect.com/science/article/pii/S1110016825003795
Tags: Add Tag
No Tags, Be the first to tag this record!