Uncertainty-aware coarse-to-fine alignment for text-image person retrieval

Uncertainty-aware coarse-to-fine alignment for text-image person retrieval

Abstract Text-to-image person retrieval, a fine-grained cross-modal retrieval problem, aims to search for person images from an image library that match a given textual caption. Existing text-to-image person retrieval methods usually use fixed-point embedding to express the semantics of the two moda...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yifei Deng, Zhengyu Chen, Chenglong Li, Jin Tang
Format:	Article
Language:	English
Published:	Springer 2025-04-01
Series:	Visual Intelligence
Subjects:	Cross-modal retrieval Uncertainty-aware Coarse-to-fine alignment Probalility distribution Contrastive learning
Online Access:	https://doi.org/10.1007/s44267-025-00078-x
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

MKNNet: Knowledge-aligned multimodal transformer for information retrieval
by: Xiaoqin Lin, et al.
Published: (2025-08-01)

DCLMA: Deep correlation learning with multi-modal attention for visual-audio retrieval
by: Jiwei Zhang, et al.
Published: (2025-09-01)

DI-VTR: Dual inter-modal interaction model for video-text retrieval
by: Jie Guo, et al.
Published: (2024-09-01)

Determination of the design velocity of the gas flow in coarse and fine filters with varying degrees of contamination in the paint booths
by: V. E. Zinurov, et al.
Published: (2022-12-01)

Interaction behavior between coarse-particle pyrite and fine-particle pyrite in flotation
by: Xianchen Wang, et al.
Published: (2025-07-01)

A Dual-Enhanced Hierarchical Alignment Framework for Multimodal Named Entity Recognition
by: Jian Wang, et al.
Published: (2025-05-01)

Multi-source point cloud registration for urban areas using a coarse-to-fine approach
by: Eunkwan Lee, et al.
Published: (2024-12-01)

Adaptive Gap-Filling of Multispectral Images at Coarse and Fine Spatial Resolution
by: Seyedkarim Afsharipour, et al.
Published: (2025-01-01)

Fine-Grained Local and Global Semantic Fusion for Multimodal Image–Text Retrieval
by: Shenao Peng, et al.
Published: (2025-02-01)

Mechanical Properties of Similar Materials Simulating Weak Surrounding Rocks with Different Ratios of Fine-to-Coarse Aggregate
by: Lan Cui, et al.
Published: (2025-03-01)

Adaptive Single-Mode Fiber Coupling Method Based on Coarse-Fine Laser Nutation
by: Bo Li, et al.
Published: (2018-01-01)

Application of 4PCS and KD-ICP Alignment Methods Based on ISS Feature Points for Rail Wear Detection
by: Jie Shan, et al.
Published: (2025-03-01)

Cross modal recipe retrieval with fine grained modal interaction
by: Fan Zhao, et al.
Published: (2025-02-01)

Graph-Based Hierarchical Semantic Consistency Network for Remote Sensing Image–Text Retrieval
by: Meiting Wang, et al.
Published: (2025-01-01)

FRORS: An Effective Fine-Grained Retrieval Framework for Optical Remote Sensing Images
by: Yong-Qiang Mao, et al.
Published: (2025-01-01)

Stress-strain relationship of steel fiber reinforced fully recycled coarse/fine aggregate concrete under cyclic loading
by: Shuqi Guo, et al.
Published: (2025-07-01)

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions With Multi-Level Representations
by: Jie Jiang, et al.
Published: (2025-01-01)

Medical LLMs: Fine-Tuning vs. Retrieval-Augmented Generation
by: Bhagyajit Pingua, et al.
Published: (2025-06-01)

Coarse and Fine Two-stage Fast Adjustment of the Ring Platform for Reactor Coolant Pump Bolt Detection based on Decoupling Parallel Mechanism
by: Shisen Lin, et al.
Published: (2021-10-01)

Multi-level fusion with fine-grained alignment for multimodal sentiment analysis
by: Xiaoge Li, et al.
Published: (2025-06-01)

In-Motion Forward–Forward Backtracking Fine Alignment Based on Displacement Observation for SINS/GNSS
by: Yongyun Zhu, et al.
Published: (2024-12-01)

Unsupervised Contrastive Graph Kolmogorov–Arnold Networks Enhanced Cross-Modal Retrieval Hashing
by: Hongyu Lin, et al.
Published: (2025-06-01)

Cross-Modality Consistency Network for Remote Sensing Text-Image Retrieval
by: Yuchen Sha, et al.
Published: (2025-01-01)

PR-CLIP: Cross-Modal Positional Reconstruction for Remote Sensing Image–Text Retrieval
by: Jihong Guan, et al.
Published: (2025-06-01)

Strong and Weak Prompt Engineering for Remote Sensing Image-Text Cross-Modal Retrieval
by: Tianci Sun, et al.
Published: (2025-01-01)

Exploring latent weight factors and global information for food-oriented cross-modal retrieval
by: Wenyu Zhao, et al.
Published: (2023-12-01)

On asymorphisms of finitary coarse spaces
by: I. V. Protasov
Published: (2021-12-01)

Choice Vectors: Streamlining Personal AI Alignment Through Binary Selection
by: Eleanor Watson, et al.
Published: (2025-03-01)

Fine-Tuning Retrieval-Augmented Generation with an Auto-Regressive Language Model for Sentiment Analysis in Financial Reviews
by: Miehleketo Mathebula, et al.
Published: (2024-11-01)

Multi-pattern time-aware sequential recommendation with data augmentation
by: LI Jiale, et al.
Published: (2024-11-01)

CADFormer: Fine-Grained Cross-Modal Alignment and Decoding Transformer for Referring Remote Sensing Image Segmentation
by: Maofu Liu, et al.
Published: (2025-01-01)

The Finite Coarse Shape Paths
by: Ivan Jelić, et al.
Published: (2025-01-01)

Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening
by: Yingjia Xu, et al.
Published: (2025-03-01)

Land Surface Condition-Driven Emissivity Variation and Its Impact on Diurnal Land Surface Temperature Retrieval Uncertainty
by: Lijuan Wang, et al.
Published: (2025-07-01)

TAMC: Textual Alignment and Masked Consistency for Open-Vocabulary 3D Scene Understanding
by: Juan Wang, et al.
Published: (2024-09-01)

Finitary approximations of coarse structures
by: I. V. Protasov
Published: (2021-03-01)

Weighted Multi-Modal Contrastive Learning Based Hybrid Network for Alzheimer’s Disease Diagnosis
by: Renping Yu, et al.
Published: (2025-01-01)

Quantifying Uncertainty in Flood Predictions in Fixed Cartesian Flood Model Due To Arbitrary Conventions in Grid Alignment
by: M. Nguyen, et al.
Published: (2025-05-01)

AN EXPERIMENTAL STUDY OF THE COARSE DROPLETS FORMATION
by: Ondřej Bartoš, et al.
Published: (2018-12-01)

An Image-Based Alignment Errors Correction Method for Segmented Fresnel Primary Mirror
by: Licheng Zhu, et al.
Published: (2020-01-01)