Uncertainty-aware coarse-to-fine alignment for text-image person retrieval

Abstract Text-to-image person retrieval, a fine-grained cross-modal retrieval problem, aims to search for person images from an image library that match a given textual caption. Existing text-to-image person retrieval methods usually use fixed-point embedding to express the semantics of the two moda...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yifei Deng, Zhengyu Chen, Chenglong Li, Jin Tang
Format:	Article
Language:	English
Published:	Springer 2025-04-01
Series:	Visual Intelligence
Subjects:	Cross-modal retrieval Uncertainty-aware Coarse-to-fine alignment Probalility distribution Contrastive learning
Online Access:	https://doi.org/10.1007/s44267-025-00078-x
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1007/s44267-025-00078-x

Uncertainty-aware coarse-to-fine alignment for text-image person retrieval

Internet

Similar Items