Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network
Speech emotion recognition (SER) is important in facilitating natural human–computer interactions. In speech sequence modeling, a vital challenge is to learn context-aware sentence expression and temporal dynamics of paralinguistic features to achieve unambiguous emotional semantic understanding. In...
Saved in:
| Main Authors: | Yuhua Wang, Jianxing Huang, Zhengdao Zhao, Haiyan Lan, Xinjia Zhang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2024-12-01
|
| Series: | Applied Sciences |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2076-3417/14/24/11494 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Enhancing Emotion Recognition in Speech Based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features
by: Bashar M. Deeb, et al.
Published: (2025-01-01) -
A Lightweight Multi-Scale Model for Speech Emotion Recognition
by: Haoming Li, et al.
Published: (2024-01-01) -
Speech Emotion Recognition Based on Sparse Representation
by: Jingjie YAN, et al.
Published: (2013-12-01) -
Hybrid LSTM–Attention and CNN Model for Enhanced Speech Emotion Recognition
by: Fazliddin Makhmudov, et al.
Published: (2024-12-01) -
Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)