Cross-modal learning with multi-modal model for video action recognition based on adaptive weight training

The canonical video action recognition methods usually label categories with numbers or one-hot vectors and train neural networks to classify a fixed set of predefined categories, thereby constraining their ability to recognise complex actions and transferable ability to unseen concepts. In contrast...

Full description

Saved in:
Bibliographic Details
Main Authors: Qingguo Zhou, Yufeng Hou, Rui Zhou, Yan Li, JinQiang Wang, Zhen Wu, Hung-Wei Li, Tien-Hsiung Weng
Format: Article
Language:English
Published: Taylor & Francis Group 2024-12-01
Series:Connection Science
Subjects:
Online Access:https://www.tandfonline.com/doi/10.1080/09540091.2024.2325474
Tags: Add Tag
No Tags, Be the first to tag this record!