Evaluating GPT- and reasoning-based large language models on Physics Olympiad problems: Surpassing human performance and implications for educational assessment

Large language models (LLMs) are now widely accessible, reaching learners across all educational levels. This development has raised concerns that their use may circumvent essential learning processes and compromise the integrity of established assessment formats. In physics education, where problem...

Full description

Saved in:

Bibliographic Details
Main Authors:	Paul Tschisgale, Holger Maus, Fabian Kieser, Ben Kroehs, Stefan Petersen, Peter Wulff
Format:	Article
Language:	English
Published:	American Physical Society 2025-08-01
Series:	Physical Review Physics Education Research
Online Access:	http://doi.org/10.1103/6fmx-bsnl
Tags:	Add Tag No Tags, Be the first to tag this record!

Be the first to leave a comment!

Evaluating GPT- and reasoning-based large language models on Physics Olympiad problems: Surpassing human performance and implications for educational assessment

Similar Items