MS-EmoBoost: a novel strategy for enhancing self-supervised speech emotion representations
Abstract Extracting richer emotional representations from raw speech is one of the key approaches to improving the accuracy of Speech Emotion Recognition (SER). In recent years, there has been a trend in utilizing self-supervised learning (SSL) for extracting SER features, due to the exceptional per...
Saved in:
| Main Authors: | Hongchen Song, Long Zhang, Meixian Gao, Hengyuan Zhang, Thomas Hain, Linlin Shan |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-07-01
|
| Series: | Scientific Reports |
| Online Access: | https://doi.org/10.1038/s41598-025-94727-2 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
EmoSDS: Unified Emotionally Adaptive Spoken Dialogue System Using Self-Supervised Speech Representations
by: Jaehwan Lee, et al.
Published: (2025-03-01) -
Generic speech enhancement with self-supervised representation space loss
by: Hiroshi Sato, et al.
Published: (2025-07-01) -
Pri-EMO: A universal perturbation method for privacy preserving facial emotion recognition
by: Yong Zeng, et al.
Published: (2023-11-01) -
EmoWELL: effectiveness of a serious game for emotion regulation in emerging adulthood
by: Saray Velert-Jiménez, et al.
Published: (2025-04-01) -
Self-supervised speech representation learning based on positive sample comparison and masking reconstruction
by: Wenlin ZHANG, et al.
Published: (2022-07-01)