Text this: S<sup>2</sup>RCFormer: Spatial-Spectral Residual Cross-Attention Transformer for Multimodal Remote Sensing Data Classification