Text this: Understanding Video Transformers: A Review on Key Strategies for Feature Learning and Performance Optimization