Cross-modal learning with multi-modal model for video action recognition based on adaptive weight training

The canonical video action recognition methods usually label categories with numbers or one-hot vectors and train neural networks to classify a fixed set of predefined categories, thereby constraining their ability to recognise complex actions and transferable ability to unseen concepts. In contrast...

Full description

Saved in:

Bibliographic Details
Main Authors:	Qingguo Zhou, Yufeng Hou, Rui Zhou, Yan Li, JinQiang Wang, Zhen Wu, Hung-Wei Li, Tien-Hsiung Weng
Format:	Article
Language:	English
Published:	Taylor & Francis Group 2024-12-01
Series:	Connection Science
Subjects:	Adaptive weight training cross-modal learning video action recognition vision-Language adaptation
Online Access:	https://www.tandfonline.com/doi/10.1080/09540091.2024.2325474
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://www.tandfonline.com/doi/10.1080/09540091.2024.2325474

Cross-modal learning with multi-modal model for video action recognition based on adaptive weight training

Internet

Similar Items