Text this: The QCML dataset, Quantum chemistry reference data from 33.5M DFT and 14.7B semi-empirical calculations