Text this: dpVision: Environment for multimodal images