T_SRNET: A multimodal model based on convolutional neural network for emotional speech enhancement

Speech classification is a technology that can determine the emotional state conveyed by speech. It can support emotion-related applications and improve the human–computer interaction experience. However, the lack of high-quality speech annotation datasets makes it difficult for many models to provi...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shaoqiang Wang, Lei Feng, Li Zhang
Format:	Article
Language:	English
Published:	Elsevier 2025-06-01
Series:	Alexandria Engineering Journal
Subjects:	Speech recognition Emotion image Diffusion model Transform structure
Online Access:	http://www.sciencedirect.com/science/article/pii/S1110016825003795
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

http://www.sciencedirect.com/science/article/pii/S1110016825003795

T_SRNET: A multimodal model based on convolutional neural network for emotional speech enhancement

Internet

Similar Items