Large language model evaluation in autoimmune disease clinical questions comparing ChatGPT 4o, Claude 3.5 Sonnet and Gemini 1.5 pro
Abstract Large language models (LLMs) have established a presence in providing medical services to patients and supporting clinical practice for doctors. To explore the ability of LLMs in answering clinical questions related to autoimmune diseases, this study was designed with 65 questions related t...
Saved in:
| Main Authors: | Juntao Ma, Jie Yu, Anran Xie, Taihong Huang, Wenjing Liu, Mengyin Ma, Yue Tao, Fuyu Zang, Qisi Zheng, Wenbo Zhu, Yuxin Chen, Mingzhe Ning, Yijia Zhu |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Nature Portfolio
2025-05-01
|
| Series: | Scientific Reports |
| Subjects: | |
| Online Access: | https://doi.org/10.1038/s41598-025-02601-y |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Accuracy of ChatGPT-3.5, ChatGPT-4o, Copilot, Gemini, Claude, and Perplexity in advising on lumbosacral radicular pain against clinical practice guidelines: cross-sectional study
by: Giacomo Rossettini, et al.
Published: (2025-06-01) -
Evaluating LLMs for Code Generation in HRI: A Comparative Study of ChatGPT, Gemini, and Claude
by: Andrei Sobo, et al.
Published: (2025-12-01) -
Performance of Large Language Models in Recognizing Brain MRI Sequences: A Comparative Analysis of ChatGPT-4o, Claude 4 Opus, and Gemini 2.5 Pro
by: Ali Salbas, et al.
Published: (2025-07-01) -
Comparative analysis of ChatGPT 3.5 and ChatGPT 4 obstetric and gynecological knowledge
by: Franciszek Ługowski, et al.
Published: (2025-07-01) -
Capabilities of ChatGPT-3.5 as a Urological Triage System
by: Christopher Hirtsiefer, et al.
Published: (2024-12-01)