Text this: Unsupervised Video Anomaly Detection Using Video Vision Transformer and Adversarial Training