Text this: Towards a multi-modal Deep Learning Architecture for User Modeling