Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation

In order to better utilize of the semantic information contained in the part of speech of words and the contextual information of unnatural language accompanying the appearance of words, a part of speech weighted multi-modal sentiment analysis model with dynamic adjustment of semantic representation...

Full description

Saved in:
Bibliographic Details
Main Authors: HUA Qiang, CHEN Zhuo, ZHANG Feng, DONG Chunru
Format: Article
Language:English
Published: Science Press (China Science Publishing & Media Ltd.) 2024-05-01
Series:Shenzhen Daxue xuebao. Ligong ban
Subjects:
Online Access:https://journal.szu.edu.cn/en/#/digest?ArticleID=2616
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849774131522830336
author HUA Qiang
CHEN Zhuo
ZHANG Feng
DONG Chunru
author_facet HUA Qiang
CHEN Zhuo
ZHANG Feng
DONG Chunru
author_sort HUA Qiang
collection DOAJ
description In order to better utilize of the semantic information contained in the part of speech of words and the contextual information of unnatural language accompanying the appearance of words, a part of speech weighted multi-modal sentiment analysis model with dynamic adjustment of semantic representation (PM-DS) is proposed. The PM-DS model takes natural language as the main body, and uses bidirectional encoder representation from transformer model, generalized autoregressive pre-training model for language understanding (XLNet) and a robustly optimized BERT pretraining approach (RoBERTa) to embed words into text patterns, respectively. A dynamic semantic adjustment module is created to effectively combine natural language and unnatural language information. The part of speech weighting module is designed to extract the part of speech of words and assigned weights to optimize sentiment discrimination. Comparative experimental results with the current advanced models such as tensor fusion network and low-rank multimodal fusion show that the average absolute errors of PW-DS model on public data sets CMU-MOSI and CMU-MOSEI are 0.607 and 0.510, respectively, and the binary classification accuracies are 89.02% and 86.93%, respectively, which is better than the models in the comparative experiments. The effects of different modules on the model are also analyzed through ablation experiments. The experimental results demonstrate that the proposed model is effective to deal with the problem of multi-modal emotion analysis.
format Article
id doaj-art-4cc8cd3cf17241858e02c31e4b5b9c1f
institution DOAJ
issn 1000-2618
language English
publishDate 2024-05-01
publisher Science Press (China Science Publishing & Media Ltd.)
record_format Article
series Shenzhen Daxue xuebao. Ligong ban
spelling doaj-art-4cc8cd3cf17241858e02c31e4b5b9c1f2025-08-20T03:01:50ZengScience Press (China Science Publishing & Media Ltd.)Shenzhen Daxue xuebao. Ligong ban1000-26182024-05-0141328329210.3724/SP.J.1249.2024.032831000-2618(2024)03-0283-10Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representationHUA Qiang0CHEN Zhuo1ZHANG Feng2DONG Chunru3Hebei Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding 071002, Hebei Province, P.R.ChinaHebei Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding 071002, Hebei Province, P.R.ChinaHebei Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding 071002, Hebei Province, P.R.ChinaHebei Key Laboratory of Machine Learning and Computational Intelligence, College of Mathematics and Information Science, Hebei University, Baoding 071002, Hebei Province, P.R.ChinaIn order to better utilize of the semantic information contained in the part of speech of words and the contextual information of unnatural language accompanying the appearance of words, a part of speech weighted multi-modal sentiment analysis model with dynamic adjustment of semantic representation (PM-DS) is proposed. The PM-DS model takes natural language as the main body, and uses bidirectional encoder representation from transformer model, generalized autoregressive pre-training model for language understanding (XLNet) and a robustly optimized BERT pretraining approach (RoBERTa) to embed words into text patterns, respectively. A dynamic semantic adjustment module is created to effectively combine natural language and unnatural language information. The part of speech weighting module is designed to extract the part of speech of words and assigned weights to optimize sentiment discrimination. Comparative experimental results with the current advanced models such as tensor fusion network and low-rank multimodal fusion show that the average absolute errors of PW-DS model on public data sets CMU-MOSI and CMU-MOSEI are 0.607 and 0.510, respectively, and the binary classification accuracies are 89.02% and 86.93%, respectively, which is better than the models in the comparative experiments. The effects of different modules on the model are also analyzed through ablation experiments. The experimental results demonstrate that the proposed model is effective to deal with the problem of multi-modal emotion analysis.https://journal.szu.edu.cn/en/#/digest?ArticleID=2616artificial intelligencemultimodal sentiment analysisdynamical adjustment semanticpart of speech weightingvisualization of multimodal vector positionvisualization of part of speech weights
spellingShingle HUA Qiang
CHEN Zhuo
ZHANG Feng
DONG Chunru
Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
Shenzhen Daxue xuebao. Ligong ban
artificial intelligence
multimodal sentiment analysis
dynamical adjustment semantic
part of speech weighting
visualization of multimodal vector position
visualization of part of speech weights
title Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
title_full Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
title_fullStr Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
title_full_unstemmed Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
title_short Part of speech weighted multi-modal emotion analysis model with dynamic adjustment of semantic representation
title_sort part of speech weighted multi modal emotion analysis model with dynamic adjustment of semantic representation
topic artificial intelligence
multimodal sentiment analysis
dynamical adjustment semantic
part of speech weighting
visualization of multimodal vector position
visualization of part of speech weights
url https://journal.szu.edu.cn/en/#/digest?ArticleID=2616
work_keys_str_mv AT huaqiang partofspeechweightedmultimodalemotionanalysismodelwithdynamicadjustmentofsemanticrepresentation
AT chenzhuo partofspeechweightedmultimodalemotionanalysismodelwithdynamicadjustmentofsemanticrepresentation
AT zhangfeng partofspeechweightedmultimodalemotionanalysismodelwithdynamicadjustmentofsemanticrepresentation
AT dongchunru partofspeechweightedmultimodalemotionanalysismodelwithdynamicadjustmentofsemanticrepresentation