Text this: Assessing the quality of automatic-generated short answers using GPT-4