Text this: Real-Time Aerial Multispectral Object Detection with Dynamic Modality-Balanced Pixel-Level Fusion