Text this: Measuring Perceptual and Linguistic Complexity in Multilingual Grounded Language Data