Speech Emotion Recognition Using Two-Stage Multiple Instance Learning Networks

In the task of speech emotion recognition (SER), each utterance is usually divided into several equal-length segments when processing the speech signals with unequal lengths, and finally emotion classification is obtained based on the average of the prediction results of all divided segments. Howeve...

Full description

Saved in:
Bibliographic Details
Main Author: ZHANG Shiqing, CHEN Chen, ZHAO Xiaoming
Format: Article
Language:zho
Published: Journal of Computer Engineering and Applications Beijing Co., Ltd., Science Press 2024-12-01
Series:Jisuanji kexue yu tansuo
Subjects:
Online Access:http://fcst.ceaj.org/fileup/1673-9418/PDF/2402013.pdf
Tags: Add Tag
No Tags, Be the first to tag this record!