Text this: Multimodal Retrieval Method for Images and Diagnostic Reports Using Cross-Attention