Knowledge enhancement for speech emotion recognition via multi-level acoustic feature
Speech emotion recognition (SER) has become an increasingly attractive machine learning task for domain applications. It aims to improve the discriminative capacity of speech emotion utilising a certain type of features (e.g. MFCC, Spectrograms, Wav2vec2) or multi-type combination features. However,...
Saved in:
| Main Authors: | Huan Zhao, Nianxin Huang, Haijiao Chen |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Taylor & Francis Group
2024-12-01
|
| Series: | Connection Science |
| Subjects: | |
| Online Access: | https://www.tandfonline.com/doi/10.1080/09540091.2024.2312103 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Enhancing Emotion Recognition in Speech Based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features
by: Bashar M. Deeb, et al.
Published: (2025-01-01) -
Hierarchical Multi-Task Learning Based on Interactive Multi-Head Attention Feature Fusion for Speech Depression Recognition
by: Yujuan Xing, et al.
Published: (2025-01-01) -
A Lightweight Forward–Backward Independent Temporal-Aware Causal Network for Speech Emotion Recognition
by: Sijia Fei, et al.
Published: (2025-01-01) -
MemoCMT: multimodal emotion recognition using cross-modal transformer-based feature fusion
by: Mustaqeem Khan, et al.
Published: (2025-02-01) -
Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network
by: Yuhua Wang, et al.
Published: (2024-12-01)