Text this: Infrared and Visible Image Fusion via Residual Interactive Transformer and Cross-Attention Fusion