Text this: Evaluating multiple large language models on orbital diseases