Text this: Apvit: ViT with adaptive patches for scene text recognition