Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts

Clear and accessible lay summaries are essential for enhancing the public understanding of scientific knowledge. This study aimed to evaluate whether ChatGPT-4 can generate high-quality lay summaries that are both accurate and comprehensible for prostate cancer research in <i>Current Oncology&...

Full description

Saved in:

Bibliographic Details
Main Authors:	Emily Rinderknecht, Anna Schmelzer, Anton Kravchuk, Christopher Goßler, Johannes Breyer, Christian Gilfrich, Maximilian Burger, Simon Engelmann, Veronika Saberi, Clemens Kirschner, Dominik von Winning, Roman Mayr, Christian Wülfing, Hendrik Borgmann, Stephan Buse, Maximilian Haas, Matthias May
Format:	Article
Language:	English
Published:	MDPI AG 2025-02-01
Series:	Current Oncology
Subjects:	patient communication artificial intelligence in healthcare language model applications plain language summaries lay abstracts prompt design
Online Access:	https://www.mdpi.com/1718-7729/32/2/102
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1849720155471347712
author	Emily Rinderknecht Anna Schmelzer Anton Kravchuk Christopher Goßler Johannes Breyer Christian Gilfrich Maximilian Burger Simon Engelmann Veronika Saberi Clemens Kirschner Dominik von Winning Roman Mayr Christian Wülfing Hendrik Borgmann Stephan Buse Maximilian Haas Matthias May
author_facet	Emily Rinderknecht Anna Schmelzer Anton Kravchuk Christopher Goßler Johannes Breyer Christian Gilfrich Maximilian Burger Simon Engelmann Veronika Saberi Clemens Kirschner Dominik von Winning Roman Mayr Christian Wülfing Hendrik Borgmann Stephan Buse Maximilian Haas Matthias May
author_sort	Emily Rinderknecht
collection	DOAJ
description	Clear and accessible lay summaries are essential for enhancing the public understanding of scientific knowledge. This study aimed to evaluate whether ChatGPT-4 can generate high-quality lay summaries that are both accurate and comprehensible for prostate cancer research in <i>Current Oncology</i>. To achieve this, it systematically assessed ChatGPT-4’s ability to summarize 80 prostate cancer articles published in the journal between July 2022 and June 2024 using two distinct prompt designs: a basic “simple” prompt and an enhanced “extended” prompt. Readability was assessed using established metrics, including the Flesch–Kincaid Reading Ease (FKRE), while content quality was evaluated with a 5-point Likert scale for alignment with source material. The extended prompt demonstrated significantly higher readability (median FKRE: 40.9 vs. 29.1, <i>p</i> < 0.001), better alignment with quality thresholds (86.2% vs. 47.5%, <i>p</i> < 0.001), and reduced the required reading level, making content more accessible. Both prompt designs produced content with high comprehensiveness (median Likert score: 5). This study highlights the critical role of tailored prompt engineering in optimizing large language models (LLMs) for medical communication. Limitations include the exclusive focus on prostate cancer, the use of predefined prompts without iterative refinement, and the absence of a direct comparison with human-crafted summaries. These findings underscore the transformative potential of LLMs like ChatGPT-4 to streamline the creation of lay summaries, reduce researchers’ workload, and enhance public engagement. Future research should explore prompt variability, incorporate patient feedback, and extend applications across broader medical domains.
format	Article
id	doaj-art-76ade3d74da14f979cff89e3dafaa87f
institution	DOAJ
issn	1198-0052 1718-7729
language	English
publishDate	2025-02-01
publisher	MDPI AG
record_format	Article
series	Current Oncology
spelling	doaj-art-76ade3d74da14f979cff89e3dafaa87f2025-08-20T03:12:00ZengMDPI AGCurrent Oncology1198-00521718-77292025-02-0132210210.3390/curroncol32020102Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer ManuscriptsEmily Rinderknecht0Anna Schmelzer1Anton Kravchuk2Christopher Goßler3Johannes Breyer4Christian Gilfrich5Maximilian Burger6Simon Engelmann7Veronika Saberi8Clemens Kirschner9Dominik von Winning10Roman Mayr11Christian Wülfing12Hendrik Borgmann13Stephan Buse14Maximilian Haas15Matthias May16Department of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Elisabeth Hospital Straubing, 94315 Straubing, GermanyDepartment of Urology, St. Elisabeth Hospital Straubing, 94315 Straubing, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Elisabeth Hospital Straubing, 94315 Straubing, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyDepartment of Urology, St. Elisabeth Hospital Straubing, 94315 Straubing, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyWorking Group on Artificial Intelligence and Digitalization of the German Society of UrologyWorking Group on Artificial Intelligence and Digitalization of the German Society of UrologyDepartment of Urology, Alfried Krupp Krankenhaus, 45131 Essen, GermanyDepartment of Urology, St. Josef Medical Center, University of Regensburg, 93053 Regensburg, GermanyWorking Group on Artificial Intelligence and Digitalization of the German Society of UrologyClear and accessible lay summaries are essential for enhancing the public understanding of scientific knowledge. This study aimed to evaluate whether ChatGPT-4 can generate high-quality lay summaries that are both accurate and comprehensible for prostate cancer research in <i>Current Oncology</i>. To achieve this, it systematically assessed ChatGPT-4’s ability to summarize 80 prostate cancer articles published in the journal between July 2022 and June 2024 using two distinct prompt designs: a basic “simple” prompt and an enhanced “extended” prompt. Readability was assessed using established metrics, including the Flesch–Kincaid Reading Ease (FKRE), while content quality was evaluated with a 5-point Likert scale for alignment with source material. The extended prompt demonstrated significantly higher readability (median FKRE: 40.9 vs. 29.1, <i>p</i> < 0.001), better alignment with quality thresholds (86.2% vs. 47.5%, <i>p</i> < 0.001), and reduced the required reading level, making content more accessible. Both prompt designs produced content with high comprehensiveness (median Likert score: 5). This study highlights the critical role of tailored prompt engineering in optimizing large language models (LLMs) for medical communication. Limitations include the exclusive focus on prostate cancer, the use of predefined prompts without iterative refinement, and the absence of a direct comparison with human-crafted summaries. These findings underscore the transformative potential of LLMs like ChatGPT-4 to streamline the creation of lay summaries, reduce researchers’ workload, and enhance public engagement. Future research should explore prompt variability, incorporate patient feedback, and extend applications across broader medical domains.https://www.mdpi.com/1718-7729/32/2/102patient communicationartificial intelligence in healthcarelanguage model applicationsplain language summarieslay abstractsprompt design
spellingShingle	Emily Rinderknecht Anna Schmelzer Anton Kravchuk Christopher Goßler Johannes Breyer Christian Gilfrich Maximilian Burger Simon Engelmann Veronika Saberi Clemens Kirschner Dominik von Winning Roman Mayr Christian Wülfing Hendrik Borgmann Stephan Buse Maximilian Haas Matthias May Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts Current Oncology patient communication artificial intelligence in healthcare language model applications plain language summaries lay abstracts prompt design
title	Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts
title_full	Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts
title_fullStr	Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts
title_full_unstemmed	Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts
title_short	Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts
title_sort	leveraging large language models for high quality lay summaries efficacy of chatgpt 4 with custom prompts in a consecutive series of prostate cancer manuscripts
topic	patient communication artificial intelligence in healthcare language model applications plain language summaries lay abstracts prompt design
url	https://www.mdpi.com/1718-7729/32/2/102
work_keys_str_mv	AT emilyrinderknecht leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT annaschmelzer leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT antonkravchuk leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT christophergoßler leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT johannesbreyer leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT christiangilfrich leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT maximilianburger leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT simonengelmann leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT veronikasaberi leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT clemenskirschner leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT dominikvonwinning leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT romanmayr leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT christianwulfing leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT hendrikborgmann leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT stephanbuse leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT maximilianhaas leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts AT matthiasmay leveraginglargelanguagemodelsforhighqualitylaysummariesefficacyofchatgpt4withcustompromptsinaconsecutiveseriesofprostatecancermanuscripts

Leveraging Large Language Models for High-Quality Lay Summaries: Efficacy of ChatGPT-4 with Custom Prompts in a Consecutive Series of Prostate Cancer Manuscripts

Similar Items