Text this: Novel Advance Image Caption Generation Utilizing Vision Transformer and Generative Adversarial Networks