Autonomous medical evaluation for guideline adherence of large language models

Abstract Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models’ adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models...

Full description

Saved in:
Bibliographic Details
Main Authors: Dennis Fast, Lisa C. Adams, Felix Busch, Conor Fallon, Marc Huppertz, Robert Siepmann, Philipp Prucker, Nadine Bayerl, Daniel Truhn, Marcus Makowski, Alexander Löser, Keno K. Bressem
Format: Article
Language:English
Published: Nature Portfolio 2024-12-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-024-01356-6
Tags: Add Tag
No Tags, Be the first to tag this record!