Text this: Multimodal sentiment analysis based on multi-layer feature fusion and multi-task learning