DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning
Purpose: To evaluate the accuracy and reasoning ability of DeepSeek-R1 and three recently released large language models (LLMs) in bilingual complex ophthalmology cases. Methods: A total of 130 multiple-choice questions (MCQs) related to diagnosis (n = 39) and management (n = 91) were collected...
Saved in:
| Main Authors: | Pusheng Xu, Yue Wu, Kai Jin, Xiaolan Chen, Mingguang He, Danli Shi |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2025-08-01
|
| Series: | Advances in Ophthalmology Practice and Research |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2667376225000290 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer
by: Ahmed Bashah, et al.
Published: (2025-08-01) -
DeepSeek calls DeepThink: rethinking AI governance and societal paradigm shift
by: WANG Fei-Yue
Published: (2025-03-01) -
DeepSeek calls DeepThink: rethinking AI governance and societal paradigm shift
by: WANG Fei-Yue
Published: (2025-03-01) -
Generative AI in Pragmatics: Assessing the Accuracy of Automated Speech Act Classification in Pinter’s The Birthday Party
by: Tadej Todorović, et al.
Published: (2025-06-01) -
Speculative futures of education: utopian and dystopian scenarios envisioned by Chatgpt, Gemini, and Deepseek
by: Jessie Ming Sin Wong
Published: (2025-08-01)