Text this: Enhancing Medical Image Classification With Context Modulated Attention and Multi-Scale Feature Fusion