Text this: Emotion recognition in panoramic audio and video virtual reality based on deep learning and feature fusion