MS-EmoBoost: a novel strategy for enhancing self-supervised speech emotion representations

Abstract Extracting richer emotional representations from raw speech is one of the key approaches to improving the accuracy of Speech Emotion Recognition (SER). In recent years, there has been a trend in utilizing self-supervised learning (SSL) for extracting SER features, due to the exceptional per...

Full description

Saved in:
Bibliographic Details
Main Authors: Hongchen Song, Long Zhang, Meixian Gao, Hengyuan Zhang, Thomas Hain, Linlin Shan
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-94727-2
Tags: Add Tag
No Tags, Be the first to tag this record!