Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies

Abstract BackgroundThe global incidence of blindness has continued to increase, despite the enactment of a Global Eye Health Action Plan by the World Health Assembly. This can be attributed, in part, to an aging population, but also to the limited diagnostic resources within l...

Full description

Saved in:
Bibliographic Details
Main Authors: Shona Alex Tapiwa M'gadzah, Andrew O'Malley
Format: Article
Language:English
Published: JMIR Publications 2025-06-01
Series:JMIR Formative Research
Online Access:https://formative.jmir.org/2025/1/e64986
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850104698050183168
author Shona Alex Tapiwa M'gadzah
Andrew O'Malley
author_facet Shona Alex Tapiwa M'gadzah
Andrew O'Malley
author_sort Shona Alex Tapiwa M'gadzah
collection DOAJ
description Abstract BackgroundThe global incidence of blindness has continued to increase, despite the enactment of a Global Eye Health Action Plan by the World Health Assembly. This can be attributed, in part, to an aging population, but also to the limited diagnostic resources within low- and middle-income countries (LMICs). The advent of generative artificial intelligence (AI) within health care could pose a novel solution to combating the prevalence of blindness globally. ObjectiveThe objectives of this study are to quantify the effect the addition of a complex prompt has on the diagnostic accuracy of a commercially available LLM, and to assess whether such LLMs are better or worse at diagnosing conditions that are more prevalent in LMICs. MethodsTen clinical vignettes representing globally and LMIC-prevalent ophthalmological conditions were presented to GPT-4‐0125-preview using simple and complex prompts. Diagnostic performance metrics, including sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), were calculated. Statistical comparison between prompts was conducted using a chi-square test of independence. ResultsThe complex prompt achieved a higher diagnostic accuracy (90.1%) compared to the simple prompt (60.4%), with a statistically significant difference (χ2P ConclusionsThe study established that overall, the inclusion of a complex prompt positively affected the diagnostic accuracy of GPT-4‐0125-preview, particularly for LMIC-prevalent conditions. This highlights the potential for LLMs, when appropriately tailored, to support clinicians in diverse health care settings. Future research should explore the generalizability of these findings across other models and specialties.
format Article
id doaj-art-38fe718e18944f7c93a72d79424ee620
institution DOAJ
issn 2561-326X
language English
publishDate 2025-06-01
publisher JMIR Publications
record_format Article
series JMIR Formative Research
spelling doaj-art-38fe718e18944f7c93a72d79424ee6202025-08-20T02:39:16ZengJMIR PublicationsJMIR Formative Research2561-326X2025-06-019e64986e6498610.2196/64986Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific PathologiesShona Alex Tapiwa M'gadzahhttp://orcid.org/0009-0001-7804-0788Andrew O'Malleyhttp://orcid.org/0000-0001-7725-4082 Abstract BackgroundThe global incidence of blindness has continued to increase, despite the enactment of a Global Eye Health Action Plan by the World Health Assembly. This can be attributed, in part, to an aging population, but also to the limited diagnostic resources within low- and middle-income countries (LMICs). The advent of generative artificial intelligence (AI) within health care could pose a novel solution to combating the prevalence of blindness globally. ObjectiveThe objectives of this study are to quantify the effect the addition of a complex prompt has on the diagnostic accuracy of a commercially available LLM, and to assess whether such LLMs are better or worse at diagnosing conditions that are more prevalent in LMICs. MethodsTen clinical vignettes representing globally and LMIC-prevalent ophthalmological conditions were presented to GPT-4‐0125-preview using simple and complex prompts. Diagnostic performance metrics, including sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV), were calculated. Statistical comparison between prompts was conducted using a chi-square test of independence. ResultsThe complex prompt achieved a higher diagnostic accuracy (90.1%) compared to the simple prompt (60.4%), with a statistically significant difference (χ2P ConclusionsThe study established that overall, the inclusion of a complex prompt positively affected the diagnostic accuracy of GPT-4‐0125-preview, particularly for LMIC-prevalent conditions. This highlights the potential for LLMs, when appropriately tailored, to support clinicians in diverse health care settings. Future research should explore the generalizability of these findings across other models and specialties.https://formative.jmir.org/2025/1/e64986
spellingShingle Shona Alex Tapiwa M'gadzah
Andrew O'Malley
Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
JMIR Formative Research
title Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
title_full Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
title_fullStr Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
title_full_unstemmed Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
title_short Enhancing Diagnostic Accuracy of Ophthalmological Conditions With Complex Prompts in GPT-4: Comparative Analysis of Global and Low- and Middle-Income Country (LMIC)–Specific Pathologies
title_sort enhancing diagnostic accuracy of ophthalmological conditions with complex prompts in gpt 4 comparative analysis of global and low and middle income country lmic specific pathologies
url https://formative.jmir.org/2025/1/e64986
work_keys_str_mv AT shonaalextapiwamgadzah enhancingdiagnosticaccuracyofophthalmologicalconditionswithcomplexpromptsingpt4comparativeanalysisofglobalandlowandmiddleincomecountrylmicspecificpathologies
AT andrewomalley enhancingdiagnosticaccuracyofophthalmologicalconditionswithcomplexpromptsingpt4comparativeanalysisofglobalandlowandmiddleincomecountrylmicspecificpathologies