Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media

In the rapidly evolving landscape of social media, the demand for precise sentiment analysis (SA) on multimodal data has become increasingly pivotal. This paper introduces a sophisticated data lake architecture tailored for efficient multimodal emotion feature extraction, addressing the challenges p...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuanyuan Fan, Xifeng Mi
Format: Article
Language:English
Published: Wiley 2024-01-01
Series:IET Software
Online Access:http://dx.doi.org/10.1049/2024/6819714
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850216699507245056
author Yuanyuan Fan
Xifeng Mi
author_facet Yuanyuan Fan
Xifeng Mi
author_sort Yuanyuan Fan
collection DOAJ
description In the rapidly evolving landscape of social media, the demand for precise sentiment analysis (SA) on multimodal data has become increasingly pivotal. This paper introduces a sophisticated data lake architecture tailored for efficient multimodal emotion feature extraction, addressing the challenges posed by diverse data types. The proposed framework encompasses a robust storage solution and an innovative SA model, multilevel spatial attention fusion (MLSAF), adept at handling text and visual data concurrently. The data lake architecture comprises five layers, facilitating real-time and offline data collection, storage, processing, standardized interface services, and data mining analysis. The MLSAF model, integrated into the data lake architecture, utilizes a novel approach to SA. It employs a text-guided spatial attention mechanism, fusing textual and visual features to discern subtle emotional interplays. The model’s end-to-end learning approach and attention modules contribute to its efficacy in capturing nuanced sentiment expressions. Empirical evaluations on established multimodal sentiment datasets, MVSA-Single and MVSA-Multi, validate the proposed methodology’s effectiveness. Comparative analyses with state-of-the-art models showcase the superior performance of our approach, with an accuracy improvement of 6% on MVSA-Single and 1.6% on MVSA-Multi. This research significantly contributes to optimizing SA in social media data by offering a versatile and potent framework for data management and analysis. The integration of MLSAF with a scalable data lake architecture presents a strategic innovation poised to navigate the evolving complexities of social media data analytics.
format Article
id doaj-art-368fbb2b207046d59d846a8780fc895c
institution OA Journals
issn 1751-8814
language English
publishDate 2024-01-01
publisher Wiley
record_format Article
series IET Software
spelling doaj-art-368fbb2b207046d59d846a8780fc895c2025-08-20T02:08:14ZengWileyIET Software1751-88142024-01-01202410.1049/2024/6819714Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social MediaYuanyuan Fan0Xifeng Mi1Jiaozuo Normal CollegeJiaozuo Normal CollegeIn the rapidly evolving landscape of social media, the demand for precise sentiment analysis (SA) on multimodal data has become increasingly pivotal. This paper introduces a sophisticated data lake architecture tailored for efficient multimodal emotion feature extraction, addressing the challenges posed by diverse data types. The proposed framework encompasses a robust storage solution and an innovative SA model, multilevel spatial attention fusion (MLSAF), adept at handling text and visual data concurrently. The data lake architecture comprises five layers, facilitating real-time and offline data collection, storage, processing, standardized interface services, and data mining analysis. The MLSAF model, integrated into the data lake architecture, utilizes a novel approach to SA. It employs a text-guided spatial attention mechanism, fusing textual and visual features to discern subtle emotional interplays. The model’s end-to-end learning approach and attention modules contribute to its efficacy in capturing nuanced sentiment expressions. Empirical evaluations on established multimodal sentiment datasets, MVSA-Single and MVSA-Multi, validate the proposed methodology’s effectiveness. Comparative analyses with state-of-the-art models showcase the superior performance of our approach, with an accuracy improvement of 6% on MVSA-Single and 1.6% on MVSA-Multi. This research significantly contributes to optimizing SA in social media data by offering a versatile and potent framework for data management and analysis. The integration of MLSAF with a scalable data lake architecture presents a strategic innovation poised to navigate the evolving complexities of social media data analytics.http://dx.doi.org/10.1049/2024/6819714
spellingShingle Yuanyuan Fan
Xifeng Mi
Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
IET Software
title Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
title_full Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
title_fullStr Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
title_full_unstemmed Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
title_short Design and Efficacy of a Data Lake Architecture for Multimodal Emotion Feature Extraction in Social Media
title_sort design and efficacy of a data lake architecture for multimodal emotion feature extraction in social media
url http://dx.doi.org/10.1049/2024/6819714
work_keys_str_mv AT yuanyuanfan designandefficacyofadatalakearchitectureformultimodalemotionfeatureextractioninsocialmedia
AT xifengmi designandefficacyofadatalakearchitectureformultimodalemotionfeatureextractioninsocialmedia