Uncertainty-aware coarse-to-fine alignment for text-image person retrieval
Abstract Text-to-image person retrieval, a fine-grained cross-modal retrieval problem, aims to search for person images from an image library that match a given textual caption. Existing text-to-image person retrieval methods usually use fixed-point embedding to express the semantics of the two moda...
Saved in:
| Main Authors: | Yifei Deng, Zhengyu Chen, Chenglong Li, Jin Tang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Springer
2025-04-01
|
| Series: | Visual Intelligence |
| Subjects: | |
| Online Access: | https://doi.org/10.1007/s44267-025-00078-x |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
MKNNet: Knowledge-aligned multimodal transformer for information retrieval
by: Xiaoqin Lin, et al.
Published: (2025-08-01) -
DCLMA: Deep correlation learning with multi-modal attention for visual-audio retrieval
by: Jiwei Zhang, et al.
Published: (2025-09-01) -
DI-VTR: Dual inter-modal interaction model for video-text retrieval
by: Jie Guo, et al.
Published: (2024-09-01) -
Determination of the design velocity of the gas flow in coarse and fine filters with varying degrees of contamination in the paint booths
by: V. E. Zinurov, et al.
Published: (2022-12-01) -
Interaction behavior between coarse-particle pyrite and fine-particle pyrite in flotation
by: Xianchen Wang, et al.
Published: (2025-07-01)