Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network

Speech emotion recognition (SER) is important in facilitating natural human–computer interactions. In speech sequence modeling, a vital challenge is to learn context-aware sentence expression and temporal dynamics of paralinguistic features to achieve unambiguous emotional semantic understanding. In...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuhua Wang, Jianxing Huang, Zhengdao Zhao, Haiyan Lan, Xinjia Zhang
Format: Article
Language:English
Published: MDPI AG 2024-12-01
Series:Applied Sciences
Subjects:
Online Access:https://www.mdpi.com/2076-3417/14/24/11494
Tags: Add Tag
No Tags, Be the first to tag this record!

Similar Items