Improving the Readability of Institutional Heart Failure–Related Patient Education Materials Using GPT-4: Observational Study

Abstract BackgroundHeart failure management involves comprehensive lifestyle modifications such as daily weights, fluid and sodium restriction, and blood pressure monitoring, placing additional responsibility on patients and caregivers, with successful adherence often requirin...

Full description

Saved in:

Bibliographic Details
Main Authors:	Ryan C King, Jamil S Samaan, Joseph Haquang, Vishnu Bharani, Samuel Margolis, Nitin Srinivasan, Yuxin Peng, Yee Hui Yeo, Roxana Ghashghaei
Format:	Article
Language:	English
Published:	JMIR Publications 2025-07-01
Series:	JMIR Cardio
Online Access:	https://cardio.jmir.org/2025/1/e68817
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	Abstract BackgroundHeart failure management involves comprehensive lifestyle modifications such as daily weights, fluid and sodium restriction, and blood pressure monitoring, placing additional responsibility on patients and caregivers, with successful adherence often requiring extensive counseling and understandable patient education materials (PEMs). Prior research has shown PEMs related to cardiovascular disease often exceed the American Medical Association’s fifth- to sixth-grade recommended reading level. The large language model (LLM) ChatGPT may be a useful tool for improving PEM readability. ObjectiveWe aim to assess the readability of heart failure–related PEMs from prominent cardiology institutions and evaluate GPT-4’s ability to improve these metrics while maintaining accuracy and comprehensiveness. MethodsA total of 143 heart failure–related PEMs were collected from the websites of the top 10 institutions listed on the 2022‐2023 US News & World Report for “Best Hospitals for Cardiology, Heart & Vascular Surgery.” PEMs were individually entered into GPT-4 (version updated July 20, 2023), preceded by the prompt, “Please explain the following in simpler terms.” Readability was assessed using the Flesch Reading Ease score, Flesch-Kincaid Grade Level (FKGL), Gunning Fog Index, Coleman-Liau Index, Simple Measure of Gobbledygook Index, and Automated Readability Index. The accuracy and comprehensiveness of revised GPT-4 PEMs were assessed by a board-certified cardiologist. ResultsFor 143 institutional heart failure–related PEMs analyzed, the median FKGL was 10.3 (IQR 7.9-13.1; high school sophomore) compared to 7.3 (IQR 6.1-8.5; seventh grade) for GPT-4’s revised PEMs (PP ConclusionsGPT-4 significantly improved the readability of institutional heart failure–related PEMs. The model may be a promising adjunct resource in addition to care provided by a licensed health care professional for patients living with heart failure. Further rigorous testing and validation is needed to investigate its safety, efficacy, and impact on patient health literacy.
ISSN:	2561-1011

Improving the Readability of Institutional Heart Failure–Related Patient Education Materials Using GPT-4: Observational Study

Similar Items