Understanding Video Transformers: A Review on Key Strategies for Feature Learning and Performance Optimization

The video transformer model, a deep learning tool relying on the self-attention mechanism, is capable of efficiently capturing and processing spatiotemporal information in videos through effective spatiotemporal modeling, thereby enabling deep analysis and precise understanding of video content. It...

Full description

Saved in:
Bibliographic Details
Main Authors: Nan Chen, Tie Xu, Mingrui Sun, Chenggui Yao, Dongping Yang
Format: Article
Language:English
Published: American Association for the Advancement of Science (AAAS) 2025-01-01
Series:Intelligent Computing
Online Access:https://spj.science.org/doi/10.34133/icomputing.0143
Tags: Add Tag
No Tags, Be the first to tag this record!