TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis

Abstract Multimodal sentiment analysis (MSA) is crucial in human-computer interaction. Current methods use simple sub-models for feature extraction, neglecting multi-scale features and the complexity of emotions. Text, visual, and audio each have unique characteristics in MSA, with text often provid...

Full description

Saved in:

Bibliographic Details
Main Authors:	Junsong Fu, Youjia Fu, Huixia Xue, Zihao Xu
Format:	Article
Language:	English
Published:	Springer 2025-01-01
Series:	Complex & Intelligent Systems
Subjects:	Multimodal sentiment analysis Multi-scale feature extraction Multimodal data fusion Transformer Unsupervised contrastive learning
Online Access:	https://doi.org/10.1007/s40747-024-01724-5
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!

TMFN: a text-based multimodal fusion network with multi-scale feature extraction and unsupervised contrastive learning for multimodal sentiment analysis

Similar Items