Text this: Multispectral Target Detection Based on Deep Feature Fusion of Visible and Infrared Modalities