Text this: CABAD: A video dataset for benchmarking child aggression recognition