Diagnostic Performance of Publicly Available Large Language Models in Corneal Diseases: A Comparison with Human Specialists
<b>Background/Objectives:</b> This study evaluated the diagnostic accuracy of seven publicly available large language models (LLMs)—GPT-3.5, GPT-4.o Mini, GPT-4.o, Gemini 1.5 Flash, Claude 3.5 Sonnet, Grok3, and DeepSeek R1—in diagnosing corneal diseases, comparing their performance to h...
Saved in:
| Main Authors: | Cheng Jiao, Erik Rosas, Hassan Asadigandomani, Mohammad Delsoz, Yeganeh Madadi, Hina Raja, Wuqaas M. Munir, Brendan Tamm, Shiva Mehravaran, Ali R. Djalilian, Siamak Yousefi, Mohammad Soleimani |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
MDPI AG
2025-05-01
|
| Series: | Diagnostics |
| Subjects: | |
| Online Access: | https://www.mdpi.com/2075-4418/15/10/1221 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Navigating the integration of ChatGPT in UAE’s government sector: challenges and opportunities
by: Ghada Nabil Goher
Published: (2025-01-01) -
Unge og helseinformasjon<subtitle>ChatGPT vs. fagpersoner</subtitle>
by: Marita Skjuve, et al.
Published: (2025-01-01) -
Avellino corneal dystrophy: corneal stars
by: Laaribi Nisrine, et al.
Published: (2025-08-01) -
ChatGPT nella valutazione dell’elaborato scritto
by: Nizzolino, Salvatore
Published: (2025-04-01) -
Leveraging human-AI collaboration to visualize age-related diabetes features using dataset
by: Yoshiyasu Takefuji
Published: (2024-09-01)