Text this: A Distillation Approach to Transformer-Based Medical Image Classification with Limited Data