Fine-Grained Local and Global Semantic Fusion for Multimodal Image–Text Retrieval

An image–text retrieval method that integrates intramodal fine-grained local semantic information and intermodal global semantic information is proposed to address the weak fine-grained discrimination capabilities for the semantic features located between image and text modalities in cross-modal ret...

Full description

Saved in:
Bibliographic Details
Main Authors: Shenao Peng, Zhongmei Wang, Jianhua Liu, Changfan Zhang, Lin Jia
Format: Article
Language:English
Published: MDPI AG 2025-02-01
Series:Big Data and Cognitive Computing
Subjects:
Online Access:https://www.mdpi.com/2504-2289/9/3/53
Tags: Add Tag
No Tags, Be the first to tag this record!