Text this: Enhancing gesture recognition with multiscale feature extraction and spatial attention.