Autonomous medical evaluation for guideline adherence of large language models
Abstract Autonomous Medical Evaluation for Guideline Adherence (AMEGA) is a comprehensive benchmark designed to evaluate large language models’ adherence to medical guidelines across 20 diagnostic scenarios spanning 13 specialties. It includes an evaluation framework and methodology to assess models...
Saved in:
| Main Authors: | Dennis Fast, Lisa C. Adams, Felix Busch, Conor Fallon, Marc Huppertz, Robert Siepmann, Philipp Prucker, Nadine Bayerl, Daniel Truhn, Marcus Makowski, Alexander Löser, Keno K. Bressem |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2024-12-01
|
| Series: | npj Digital Medicine |
| Online Access: | https://doi.org/10.1038/s41746-024-01356-6 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Current applications and challenges in large language models for patient care: a systematic review
by: Felix Busch, et al.
Published: (2025-01-01) -
Data extraction from free-text stroke CT reports using GPT-4o and Llama-3.3-70B: the impact of annotation guidelines
by: Jonas Wihl, et al.
Published: (2025-06-01) -
Adherence to guidelines in the diagnosis and management of autoimmune hepatitis
by: Roshni Patel, et al.
Published: (2025-07-01) -
Doctors’ Adherence to the Guidelines on the Oral Anticoagulants Usage
by: N. B. Perepech, et al.
Published: (2020-11-01) -
Non-user acceptance of autonomous technology: A survey of bicyclist receptivity to fully autonomous vehicles
by: Peter André Busch
Published: (2024-12-01)