Large language models underperform in European general surgery board examinations: a comparative study with experts and surgical residents

Large language models underperform in European general surgery board examinations: a comparative study with experts and surgical residents

Abstract Background Artificial intelligence (AI) has become a transformative tool in medical education and assessment. Despite advancements, AI models such as GPT-4o demonstrate variable performance on high-stakes examinations. This study compared the performance of four AI models (Llama-3, Gemini,...

Full description

Saved in:

Bibliographic Details
Main Author:	Melih Can Gül
Format:	Article
Language:	English
Published:	BMC 2025-08-01
Series:	BMC Medical Education
Subjects:	Artificial intelligence Board examinations Human-AI comparison Medical education Surgical training
Online Access:	https://doi.org/10.1186/s12909-025-07856-7
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The impact of a targeted Arab Board of Emergency Medicine examination preparation course on resident success rates
by: Shahzad Anjum, et al.
Published: (2025-02-01)

Can AI pass the written European Board Examination in Neurological Surgery? - Ethical and practical issues
by: Felix C. Stengel, et al.
Published: (2024-01-01)

Strategy Committees on European Public Boards: Characteristics and Tasks
by: Małgorzata Kuczara
Published: (2024-01-01)

Determinants of performance in the board examination for Mechanical Engineering graduates of the Nueva Vizcaya State University, Bambang campus
by: Dale Mark N. Bristol, et al.
Published: (2023-12-01)

PROBATIVE VALUE OF HIGHER FORENSIC MEDICAL BOARD”S OPINIONS
by: Alina-Marilena ŢUCĂ
Published: (2021-05-01)

Legitimacy of the Notary Inspection Board within the Indonesian Legal Framework
by: Ahmad Muhajir Firrizqi Mubaroq, et al.
Published: (2023-12-01)

Impact of Board Incentives and Board Interlocks on Audit Fees
by: Elham Chenari
Published: (2020-08-01)

Problems of Commissioning of Technical Expert Examination of Documents
by: T. F. Bezsonna, et al.
Published: (2021-03-01)

Into the revolution of generative artificial intelligence to develop seasonal fashion mood boards
by: Jannat Rakeya, et al.
Published: (2025-01-01)

Limitation of the General Assembly’s Authority to Dismiss the Members of the Board of Directors
by: Veliye Yanlı, et al.
Published: (2023-06-01)

Hybrid inspection of printed board defects
by: V. V. Vengerenko, et al.
Published: (2024-09-01)

Board Diversity and Environmental Performance with a Focus on Moderating Effect of Board Independence
by: Sohrab Osta, et al.
Published: (2025-07-01)

Editorial Board
by: Elektronika ir Elektrotechnika
Published: (2025-04-01)

Editorial Board
by: JT ADMIN
Published: (2021-03-01)

Editorial board
by: JT ADMIN
Published: (2020-12-01)

Editorial Board
by: Artūras Štikonas
Published: (2023-12-01)

In the boardroom spotlight: Examining the relationship between board size and profitability in Turkish family businesses with independent directors as key players
by: Kamran Jannatli
Published: (2024-09-01)

Why bother: usefulness and effect of young surgeon committees in surgical societies
by: Lutz Brigitta M., et al.
Published: (2018-12-01)

Artificial Intelligence vs. Human Cognition: A Comparative Analysis of ChatGPT and Candidates Sitting the European Board of Ophthalmology Diploma Examination
by: Anna P. Maino, et al.
Published: (2025-04-01)

Board of Commissioners’ Relationship and Climate Change Disclosure: Evidence from Mining Companies
by: Kurnia Rina Ariani, et al.
Published: (2023-09-01)

Editorial Board
by: Artūras Štikonas
Published: (2024-01-01)

Editorial Board, Vol. 4, No. 1
by: Osamah Sabah Barrak
Published: (2022-03-01)

Board Characteristics Best Practices and Financial Performance. Evidence from the European Capital Market
by: Victor-Octavian Müller, et al.
Published: (2014-05-01)

The impact of emergency department crowding and patient boarding on resident point‐of‐care ultrasound education
by: Brandon Michael Wubben, et al.
Published: (2024-06-01)

Spine deformity board and the need for a multidisciplinary discussion of complex spine surgery cases: A proposal from the EANS young neurosurgeons committee
by: Luca Ricciardi, et al.
Published: (2024-01-01)

The role of media exposure on board capital and carbon emission disclosure
by: Mohammad Syafik, et al.
Published: (2025-04-01)

The effect of board of directors’ characteristics on disclosing tone in the annual reports: evidence from Amman stock exchange
by: Salah Kayed, et al.
Published: (2024-10-01)

Influence of gamification on skill-based training of surgical residents
by: Damla Topalli, et al.
Published: (2025-02-01)

Assessing ChatGPT-4’s Capabilities in Generating Dermatology Board Examination Content: An Explorational Study
by: Jonathan Shapiro, et al.
Published: (2025-01-01)

Cataract surgical training in Poland: analysis of the European board of ophthalmology survey results
by: Rémi Yaïci, et al.
Published: (2025-05-01)

Examiner workload comparison: three structured oral examination formats for the European diploma in anaesthesiology and intensive care
by: Mikhail Dziadzko, et al.
Published: (2024-12-01)

On Actual Issues of Interaction of Expert Community and Power: Conceptualization of a Role of Public Experts in Formation of the Agenda of the State
by: Marina Vasilyevna Noskova
Published: (2018-04-01)

Evaluation of Chat Generative Pre-trained Transformer and Microsoft Copilot Performance on the American Society of Surgery of the Hand Self-Assessment Examinations
by: Taylor R. Rakauskas, BS, et al.
Published: (2025-01-01)

Board Structure and the Profitability of Listed Consumer Goods Firms in Nigeria
by: Yusufu, Ojochenemi Sunday, et al.
Published: (2023-12-01)

Demarcating a Linguistic Expert’s and an Authorship Investigator’s Competencies When Examining Copyright and Related Rights Objects
by: V. O. Kuznetsov, et al.
Published: (2019-10-01)

The Mediating Effects of Board Role Performance in the Relationship Between Board Capital and Survival of Financial Cooperatives in Uganda
by: Francis Yosa, et al.
Published: (2024-11-01)

Examine the application extent of periodic medical examinations for pregnant women in Al-Abassia general hospital in Al-Najaf Al-Ashraf province
by: Atheer Kadhim Ibadi
Published: (2011-12-01)

The Influence of Majority Ownership, Profitability, Size of the Board of Directors, and Frequency of Board of Commissioners Meetings on Sustainability Report Disclosure
by: Rina Trisnawati, et al.
Published: (2022-04-01)

RESULTS OF HALF-CENTURY SURGICAL TREATMENT OF HALLUX VALGUS ASSOCIATED WITH TRANSVERSE FLAT FEET
by: E.V. KHALIMOV, T.S. BARANOVA, A.YU. MIKHAILOV, A.I. ZAKIRYANOVA, A.R. SHAGEEVA
Published: (2025-07-01)

The board as an example of Japanese corporate governance system hybridization: An outline of the problem
by: Magdalena Jerzemowska, et al.
Published: (2020-09-01)