Text this: Collaborative Joint Perception and Prediction for Autonomous Driving