Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening

Efficient text-to-video retrieval via multi-modal multi-tagger derived pre-screening

Abstract Text-to-video retrieval (TVR) has made significant progress with advances in vision and language representation learning. Most existing methods use real-valued and hash-based embeddings to represent the video and text, allowing retrieval by computing their similarities. However, these metho...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yingjia Xu, Mengxia Wu, Zixin Guo, Min Cao, Mang Ye, Jorma Laaksonen
Format:	Article
Language:	English
Published:	Springer 2025-03-01
Series:	Visual Intelligence
Subjects:	Text-to-video retrieval (TVR) Inverted index Pre-screening Contrastive learning (CL)
Online Access:	https://doi.org/10.1007/s44267-025-00073-2
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Tencent Text-Video Retrieval: Hierarchical Cross-Modal Interactions With Multi-Level Representations
by: Jie Jiang, et al.
Published: (2025-01-01)

DI-VTR: Dual inter-modal interaction model for video-text retrieval
by: Jie Guo, et al.
Published: (2024-09-01)

Dialogue-to-Video Retrieval via Multi-Grained Attention Network
by: Yi Yu, et al.
Published: (2025-01-01)

Hierarchical multi‐modal video summarization with dynamic sampling
by: Lingjian Yu, et al.
Published: (2024-12-01)

Strong and Weak Prompt Engineering for Remote Sensing Image-Text Cross-Modal Retrieval
by: Tianci Sun, et al.
Published: (2025-01-01)

INRNet: Neighborhood Re-Ranking-Based Method for Pedestrian Text-Image Retrieval
by: Kehao Wang, et al.
Published: (2025-01-01)

Cross-Modality Consistency Network for Remote Sensing Text-Image Retrieval
by: Yuchen Sha, et al.
Published: (2025-01-01)

Multi Stage Retrieval for Web Search During Crisis
by: Claudiu Constantin Tcaciuc, et al.
Published: (2025-05-01)

Multimodal Latent Representation Learning for Video Moment Retrieval
by: Jinkwon Hwang, et al.
Published: (2025-07-01)

Image information measurement for video retrieval
by: Qing-sheng YUAN, et al.
Published: (2016-02-01)

Text Retrieval in Restricted Domains by Pairwise Term Co-occurrence
by: Eriks Sneiders, et al.
Published: (2024-12-01)

Exploring fonts as retrieval cues in text-based learning
by: Felix Krieglstein, et al.
Published: (2024-11-01)

Uncertainty-aware coarse-to-fine alignment for text-image person retrieval
by: Yifei Deng, et al.
Published: (2025-04-01)

Interactive Content Retrieval in Egocentric Videos Based on Vague Semantic Queries
by: Linda Ablaoui, et al.
Published: (2025-06-01)

Unsupervised Contrastive Graph Kolmogorov–Arnold Networks Enhanced Cross-Modal Retrieval Hashing
by: Hongyu Lin, et al.
Published: (2025-06-01)

Evolution of public broadcasting: TeleJurnal’s viewership trends and strategic implications for business sustainability
by: Săseanu Ramona, et al.
Published: (2025-06-01)

Designing Animation Videos of Narrative Texts as Supplementary Reading Material
by: Meysithah Risky Dwiyani, et al.
Published: (2024-08-01)

DCLMA: Deep correlation learning with multi-modal attention for visual-audio retrieval
by: Jiwei Zhang, et al.
Published: (2025-09-01)

Enhancing Weibo Sentiment Analysis With Multi-Modal Learning: Integrating Text and Synthesized Images With Contrastive Learning
by: Chuyang Wang, et al.
Published: (2025-01-01)

Explainable Identification of Similarities Between Entities for Discovery in Large Text
by: Akhil Joshi, et al.
Published: (2025-03-01)

AI-driven video summarization for optimizing content retrieval and management through deep learning techniques
by: Deepali Vora, et al.
Published: (2025-02-01)

Hierarchical contrastive learning for multi-label text classification
by: Wei Zhang, et al.
Published: (2025-04-01)

A Multi-Level Multiple Contrastive Learning Method for Single-Lead Electrocardiogram Atrial Fibrillation Detection
by: Yonggang Zou, et al.
Published: (2025-01-01)

PR-CLIP: Cross-Modal Positional Reconstruction for Remote Sensing Image–Text Retrieval
by: Jihong Guan, et al.
Published: (2025-06-01)

Applying GA for Optimizing the User Query in Image and Video Retrieval
by: Ehsan Lotfi
Published: (2024-02-01)

Three-Layer Retrieval and Self-Evaluation Classification Method Based on FastText Algorithm
by: Yidan Li, et al.
Published: (2025-01-01)

Study on video action recognition based on augment negative example multi-granularity discrimination model
by: LIU Liangzhen, et al.
Published: (2024-12-01)

Frequency Spectrum Adaptor for Remote Sensing Image–Text Retrieval
by: Ziyi Wan, et al.
Published: (2025-01-01)

Spatial Position Reasoning of Image Entities Based on Location Words
by: Xingguo Qin, et al.
Published: (2024-12-01)

Jeu vidéo et science-fiction >> de mode texte en code texte
by: Estelle Dalleu
Published: (2018-12-01)

Exploring latent weight factors and global information for food-oriented cross-modal retrieval
by: Wenyu Zhao, et al.
Published: (2023-12-01)

An enhanced text classification model by the inverted attention orthogonal projection module
by: Hong Zhao, et al.
Published: (2023-12-01)

CrossModal Retrieval with Deep Learning
by: WANG Hongzhi, et al.
Published: (2021-02-01)

Exploration de l’activité de publication et de recherche de vidéos sur une plateforme audiovisuelle académique en ligne
by: Emmanuelle Papinot, et al.
Published: (2018-04-01)

A retrieval-augmented prompting network for hateful meme detection
by: Qiuhua Kuang, et al.
Published: (2025-07-01)

Multimodal retrieval-augmented generation framework for machine translation
by: Shijian Li
Published: (2025-08-01)

Heterogeneous Graph Neural Network with Multi-View Contrastive Learning for Cross-Lingual Text Classification
by: Xun Li, et al.
Published: (2025-03-01)

Text-Guided Visual Representation Optimization for Sensor-Acquired Video Temporal Grounding
by: Yun Tian, et al.
Published: (2025-07-01)

Text2SQL Business Intelligence System Based on Retrieval‐Augmented Generation (RAG)
by: Jie Liu, et al.
Published: (2025-06-01)

Cross modal recipe retrieval with fine grained modal interaction
by: Fan Zhao, et al.
Published: (2025-02-01)