Text this: A large synthetic dataset for machine learning applications in power transmission grids