Text this: Multistage Training and Fusion Method for Imbalanced Multimodal UAV Remote Sensing Classification