Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network

Speech Emotion Recognition Using Multi-Scale Global–Local Representation Learning with Feature Pyramid Network

Speech emotion recognition (SER) is important in facilitating natural human–computer interactions. In speech sequence modeling, a vital challenge is to learn context-aware sentence expression and temporal dynamics of paralinguistic features to achieve unambiguous emotional semantic understanding. In...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yuhua Wang, Jianxing Huang, Zhengdao Zhao, Haiyan Lan, Xinjia Zhang
Format:	Article
Language:	English
Published:	MDPI AG 2024-12-01
Series:	Applied Sciences
Subjects:	speech emotion recognition multi-scale feature pyramid network convolutional self-attention
Online Access:	https://www.mdpi.com/2076-3417/14/24/11494
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Enhancing Emotion Recognition in Speech Based on Self-Supervised Learning: Cross-Attention Fusion of Acoustic and Semantic Features
by: Bashar M. Deeb, et al.
Published: (2025-01-01)

A Lightweight Multi-Scale Model for Speech Emotion Recognition
by: Haoming Li, et al.
Published: (2024-01-01)

Speech Emotion Recognition Based on Sparse Representation
by: Jingjie YAN, et al.
Published: (2013-12-01)

Hybrid LSTM–Attention and CNN Model for Enhanced Speech Emotion Recognition
by: Fazliddin Makhmudov, et al.
Published: (2024-12-01)

Optimizing Speech Emotion Recognition with Hilbert Curve and convolutional neural network
by: Zijun Yang, et al.
Published: (2024-01-01)

Speech Emotion Recognition: Humans vs Machines
by: S. Werner, et al.
Published: (2019-12-01)

Enhancing Embedded Space with Low–Level Features for Speech Emotion Recognition
by: Lukasz Smietanka, et al.
Published: (2025-02-01)

A multi-dilated convolution network for speech emotion recognition
by: Samaneh Madanian, et al.
Published: (2025-03-01)

Speech Emotion Recognition Using a Multi-Time-Scale Approach to Feature Aggregation and an Ensemble of SVM Classifiers
by: Antonina STEFANOWSKA, et al.
Published: (2024-03-01)

Multi-branch feature learning based speech emotion recognition using SCAR-NET
by: Keji Mao, et al.
Published: (2023-12-01)

MSDSANet: Multimodal Emotion Recognition Based on Multi-Stream Network and Dual-Scale Attention Network Feature Representation
by: Weitong Sun, et al.
Published: (2025-03-01)

A Lightweight Forward–Backward Independent Temporal-Aware Causal Network for Speech Emotion Recognition
by: Sijia Fei, et al.
Published: (2025-01-01)

Speech Emotion Recognition under White Noise
by: Chengwei HUANG, et al.
Published: (2013-12-01)

Speech Emotion Recognition on MELD and RAVDESS Datasets Using CNN
by: Gheed T. Waleed, et al.
Published: (2025-06-01)

Feature pyramid attention network for audio‐visual scene classification
by: Liguang Zhou, et al.
Published: (2025-04-01)

Attention-Based Multi-Learning Approach for Speech Emotion Recognition With Dilated Convolution
by: Samuel, Kakuba, et al.
Published: (2023)

On Image Ｒecognition Using Bidirectional Feature Pyramid and Deep Neural Network
by: ZHAO Sheng, et al.
Published: (2021-04-01)

Speech Emotion Recognition Using Two-Stage Multiple Instance Learning Networks
by: ZHANG Shiqing, CHEN Chen, ZHAO Xiaoming
Published: (2024-12-01)

Speech emotion recognition algorithm of intelligent robot based on ACO-SVM
by: Xueliang Kang
Published: (2025-12-01)

Preprocessing signal for Speech Emotion Recognition
by: Bashar M. Nema, et al.
Published: (2018-07-01)

Knowledge enhancement for speech emotion recognition via multi-level acoustic feature
by: Huan Zhao, et al.
Published: (2024-12-01)

CAs-Net: A Channel-Aware Speech Network for Uyghur Speech Recognition
by: Jiang Zhang, et al.
Published: (2025-06-01)

Deep Learning-Based Speech Emotion Recognition Using Multi-Level Fusion of Concurrent Features
by: Samuel, Kakuba, et al.
Published: (2023)

Multimodal Emotion Recognition Based on Facial Expressions, Speech, and EEG
by: Jiahui Pan, et al.
Published: (2024-01-01)

Emotion Recognition from Speech in a Subject-Independent Approach
by: Andrzej Majkowski, et al.
Published: (2025-06-01)

Exploration of Complementary Features for Speech Emotion Recognition Based on Kernel Extreme Learning Machine
by: Lili Guo, et al.
Published: (2019-01-01)

Speech Emotion Recognition Based on Voice Fundamental Frequency
by: Teodora DIMITROVA-GREKOW, et al.
Published: (2019-04-01)

Deep learning techniques for speech emotion recognition: A review
by: Silviana Widya Lestari, et al.
Published: (2023-06-01)

A Comprehensive Analysis of Data Augmentation Methods for Speech Emotion Recognition
by: Umut Avci
Published: (2025-01-01)

Speech Emotion Recognition: Comparative Analysis of CNN-LSTM and Attention-Enhanced CNN-LSTM Models
by: Jamsher Bhanbhro, et al.
Published: (2025-05-01)

End-to-end feature fusion for jointly optimized speech enhancement and automatic speech recognition
by: Mohamed Medani, et al.
Published: (2025-07-01)

Domain Adapting Deep Reinforcement Learning for Real-World Speech Emotion Recognition
by: Thejan Rajapakshe, et al.
Published: (2024-01-01)

Polish Speech and Text Emotion Recognition in a Multimodal Emotion Analysis System
by: Kamil Skowroński, et al.
Published: (2024-11-01)

A human pose estimation network based on YOLOv8 framework with efficient multi-scale receptive field and expanded feature pyramid network
by: Shaobin Cai, et al.
Published: (2025-05-01)

MemoCMT: multimodal emotion recognition using cross-modal transformer-based feature fusion
by: Mustaqeem Khan, et al.
Published: (2025-02-01)

Speech emotion recognition with light weight deep neural ensemble model using hand crafted features
by: Jaher Hassan Chowdhury, et al.
Published: (2025-04-01)

Speech emotion recognition using long-term average spectrum
by: Huerta-Hernández Luis David, et al.
Published: (2025-04-01)

Speech emotion recognition based on a stacked autoencoders optimized by PSO based grass fibrous root optimization
by: Chi Zeng, et al.
Published: (2025-07-01)

Artificial Neural Network vs. Support Vector Machine For Speech Emotion Recognition
by: Mohamed. A. Ahmad
Published: (2023-02-01)

Addressing data scarcity in speech emotion recognition: A comprehensive review
by: Samuel Kakuba, et al.
Published: (2025-02-01)