ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial

BackgroundThe increasing development and spread of artificial and assistive intelligence is opening up new areas of application not only in applied medicine but also in related fields such as continuing medical education (CME), which is part of the mandatory training program...

Full description

Saved in:
Bibliographic Details
Main Authors: Christian Burisch, Abhav Bellary, Frank Breuckmann, Jan Ehlers, Serge C Thal, Timur Sellmann, Daniel Gödde
Format: Article
Language:English
Published: JMIR Publications 2025-02-01
Series:JMIR Research Protocols
Online Access:https://www.researchprotocols.org/2025/1/e63887
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1825208900786323456
author Christian Burisch
Abhav Bellary
Frank Breuckmann
Jan Ehlers
Serge C Thal
Timur Sellmann
Daniel Gödde
author_facet Christian Burisch
Abhav Bellary
Frank Breuckmann
Jan Ehlers
Serge C Thal
Timur Sellmann
Daniel Gödde
author_sort Christian Burisch
collection DOAJ
description BackgroundThe increasing development and spread of artificial and assistive intelligence is opening up new areas of application not only in applied medicine but also in related fields such as continuing medical education (CME), which is part of the mandatory training program for medical doctors in Germany. This study aimed to determine whether medical laypersons can successfully conduct training courses specifically for physicians with the help of a large language model (LLM) such as ChatGPT-4. This study aims to qualitatively and quantitatively investigate the impact of using artificial intelligence (AI; specifically ChatGPT) on the acquisition of credit points in German postgraduate medical education. ObjectiveUsing this approach, we wanted to test further possible applications of AI in the postgraduate medical education setting and obtain results for practical use. Depending on the results, the potential influence of LLMs such as ChatGPT-4 on CME will be discussed, for example, as part of a SWOT (strengths, weaknesses, opportunities, threats) analysis. MethodsWe designed a randomized controlled trial, in which adult high school students attempt to solve CME tests across six medical specialties in three study arms in total with 18 CME training courses per study arm under different interventional conditions with varying amounts of permitted use of ChatGPT-4. Sample size calculation was performed including guess probability (20% correct answers, SD=40%; confidence level of 1–α=.95/α=.05; test power of 1–β=.95; P<.05). The study was registered at open scientific framework. ResultsAs of October 2024, the acquisition of data and students to participate in the trial is ongoing. Upon analysis of our acquired data, we predict our findings to be ready for publication as soon as early 2025. ConclusionsWe aim to prove that the advances in AI, especially LLMs such as ChatGPT-4 have considerable effects on medical laypersons’ ability to successfully pass CME tests. The implications that this holds on how the concept of continuous medical education requires reevaluation are yet to be contemplated. Trial RegistrationOSF Registries 10.17605/OSF.IO/MZNUF; https://osf.io/mznuf International Registered Report Identifier (IRRID)PRR1-10.2196/63887
format Article
id doaj-art-e68b379a457946f98334caeb97d0a59a
institution Kabale University
issn 1929-0748
language English
publishDate 2025-02-01
publisher JMIR Publications
record_format Article
series JMIR Research Protocols
spelling doaj-art-e68b379a457946f98334caeb97d0a59a2025-02-06T17:31:33ZengJMIR PublicationsJMIR Research Protocols1929-07482025-02-0114e6388710.2196/63887ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled TrialChristian Burischhttps://orcid.org/0009-0009-9710-7827Abhav Bellaryhttps://orcid.org/0009-0004-8349-3468Frank Breuckmannhttps://orcid.org/0000-0001-7245-8000Jan Ehlershttps://orcid.org/0000-0001-6306-4173Serge C Thalhttps://orcid.org/0000-0002-1222-8729Timur Sellmannhttps://orcid.org/0000-0002-1471-6806Daniel Göddehttps://orcid.org/0000-0002-8430-1411 BackgroundThe increasing development and spread of artificial and assistive intelligence is opening up new areas of application not only in applied medicine but also in related fields such as continuing medical education (CME), which is part of the mandatory training program for medical doctors in Germany. This study aimed to determine whether medical laypersons can successfully conduct training courses specifically for physicians with the help of a large language model (LLM) such as ChatGPT-4. This study aims to qualitatively and quantitatively investigate the impact of using artificial intelligence (AI; specifically ChatGPT) on the acquisition of credit points in German postgraduate medical education. ObjectiveUsing this approach, we wanted to test further possible applications of AI in the postgraduate medical education setting and obtain results for practical use. Depending on the results, the potential influence of LLMs such as ChatGPT-4 on CME will be discussed, for example, as part of a SWOT (strengths, weaknesses, opportunities, threats) analysis. MethodsWe designed a randomized controlled trial, in which adult high school students attempt to solve CME tests across six medical specialties in three study arms in total with 18 CME training courses per study arm under different interventional conditions with varying amounts of permitted use of ChatGPT-4. Sample size calculation was performed including guess probability (20% correct answers, SD=40%; confidence level of 1–α=.95/α=.05; test power of 1–β=.95; P<.05). The study was registered at open scientific framework. ResultsAs of October 2024, the acquisition of data and students to participate in the trial is ongoing. Upon analysis of our acquired data, we predict our findings to be ready for publication as soon as early 2025. ConclusionsWe aim to prove that the advances in AI, especially LLMs such as ChatGPT-4 have considerable effects on medical laypersons’ ability to successfully pass CME tests. The implications that this holds on how the concept of continuous medical education requires reevaluation are yet to be contemplated. Trial RegistrationOSF Registries 10.17605/OSF.IO/MZNUF; https://osf.io/mznuf International Registered Report Identifier (IRRID)PRR1-10.2196/63887https://www.researchprotocols.org/2025/1/e63887
spellingShingle Christian Burisch
Abhav Bellary
Frank Breuckmann
Jan Ehlers
Serge C Thal
Timur Sellmann
Daniel Gödde
ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
JMIR Research Protocols
title ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
title_full ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
title_fullStr ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
title_full_unstemmed ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
title_short ChatGPT-4 Performance on German Continuing Medical Education—Friend or Foe (Trick or Treat)? Protocol for a Randomized Controlled Trial
title_sort chatgpt 4 performance on german continuing medical education friend or foe trick or treat protocol for a randomized controlled trial
url https://www.researchprotocols.org/2025/1/e63887
work_keys_str_mv AT christianburisch chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT abhavbellary chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT frankbreuckmann chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT janehlers chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT sergecthal chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT timursellmann chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial
AT danielgodde chatgpt4performanceongermancontinuingmedicaleducationfriendorfoetrickortreatprotocolforarandomizedcontrolledtrial