Text this: A Latent Multi-Scale Residual Transformer Approach for Cross-Modal Medical Image Synthesis