DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning

DeepSeek-R1 outperforms Gemini 2.0 Pro, OpenAI o1, and o3-mini in bilingual complex ophthalmology reasoning

Purpose: To evaluate the accuracy and reasoning ability of DeepSeek-R1 and three recently released large language models (LLMs) in bilingual complex ophthalmology cases. Methods: A total of 130 multiple-choice questions (MCQs) related to diagnosis (n = 39) and management (n = 91) were collected...

Full description

Saved in:

Bibliographic Details
Main Authors:	Pusheng Xu, Yue Wu, Kai Jin, Xiaolan Chen, Mingguang He, Danli Shi
Format:	Article
Language:	English
Published:	Elsevier 2025-08-01
Series:	Advances in Ophthalmology Practice and Research
Subjects:	Large language models DeepSeek Gemini OpenAI Clinical decision support Reasoning ability
Online Access:	http://www.sciencedirect.com/science/article/pii/S2667376225000290
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Evaluation of deepseek, gemini, ChatGPT-4o, and perplexity in responding to salivary gland cancer
by: Ahmed Bashah, et al.
Published: (2025-08-01)

DeepSeek calls DeepThink: rethinking AI governance and societal paradigm shift
by: WANG Fei-Yue
Published: (2025-03-01)

DeepSeek calls DeepThink: rethinking AI governance and societal paradigm shift
by: WANG Fei-Yue
Published: (2025-03-01)

Generative AI in Pragmatics: Assessing the Accuracy of Automated Speech Act Classification in Pinter’s The Birthday Party
by: Tadej Todorović, et al.
Published: (2025-06-01)

Speculative futures of education: utopian and dystopian scenarios envisioned by Chatgpt, Gemini, and Deepseek
by: Jessie Ming Sin Wong
Published: (2025-08-01)

Medical reasoning in LLMs: an in-depth analysis of DeepSeek R1
by: Birger Moëll, et al.
Published: (2025-06-01)

AI in conjunctivitis research: assessing ChatGPT and DeepSeek for etiology, intervention, and citation integrity via hallucination rate analysis
by: Muhammad Hasnain, et al.
Published: (2025-08-01)

Assessing the reliability and relevance of DeepSeek in EFL writing evaluation: a generalizability theory approach
by: Huixin Gao, et al.
Published: (2025-06-01)

Large Language Models for Transforming Healthcare: A Perspective on DeepSeek‐R1
by: Jinsong Zhou, et al.
Published: (2025-06-01)

Evaluating ChatGPT and DeepSeek in postdural puncture headache management: a comparative study with international consensus guidelines
by: Jiayi Deng, et al.
Published: (2025-07-01)

DeepSeek vs. ChatGPT: prospects and challenges
by: Inhye Jin, et al.
Published: (2025-06-01)

Pre-operative T-stage discrimination in gallbladder cancer using machine learning and DeepSeek-R1
by: Joongwon Chae, et al.
Published: (2025-08-01)

Advancing Software Vulnerability Detection with Reasoning LLMs: DeepSeek-R1′s Performance and Insights
by: Wenting Qin, et al.
Published: (2025-06-01)

Battle of the artificial intelligence: a comprehensive comparative analysis of DeepSeek and ChatGPT for urinary incontinence-related questions
by: Huawei Cao, et al.
Published: (2025-07-01)

Exploring the Joint Influence of Built Environment Factors on Urban Rail Transit Peak-Hour Ridership Using DeepSeek
by: Zhuorui Wang, et al.
Published: (2025-05-01)

Comparative Efficacy of ChatGPT and DeepSeek in Addressing Patient Queries on Gonarthrosis and Total Knee Arthroplasty
by: Serhat Gurbuz, MD, et al.
Published: (2025-06-01)

Synergizing DeepSeek's artificial intelligence innovations with brain–computer interfaces
by: Canbiao Wu, et al.
Published: (2025-06-01)

OpenAI o1 Large Language Model Outperforms GPT-4o, Gemini 1.5 Flash, and Human Test Takers on Ophthalmology Board–Style Questions
by: Ryan Shean, BA, et al.
Published: (2025-11-01)

Evaluating Handwritten Answers Using DeepSeek: A Comparative Analysis of Deep Learning-Based Assessment
by: Sanskar Bansal, et al.
Published: (2025-08-01)

Cybersecure XAI Algorithm for Generating Recommendations Based on Financial Fundamentals Using DeepSeek
by: Iván García-Magariño, et al.
Published: (2025-05-01)

DeepSeek or ChatGPT: Can brain‐computer interfaces/brain‐inspired computing achieve leapfrog development with large AI models?
by: Long Bai, et al.
Published: (2025-03-01)

AI-driven feedback system: Implementing advanced NLP and openAI for online learning
by: Liberius Sabinus Koe, et al.
Published: (2025-01-01)

Open Source HBIM and OpenAI: Review and New Analyses on LLMs Integration
by: Filippo Diara
Published: (2025-04-01)

Surface and Antimicrobial Properties of Ester-Based Gemini Surfactants
by: Iwona Kowalczyk, et al.
Published: (2025-06-01)

Assessing DeepSeek R1 and ChatGPT 4.5 in Arabic-English literary translation: performance, challenges, and implications
by: Rachid Ed-Dali
Published: (2025-12-01)

A comparative analysis of DeepSeek R1, DeepSeek-R1-Lite, OpenAi o1 Pro, and Grok 3 performance on ophthalmology board-style questions
by: Ryan Shean, et al.
Published: (2025-07-01)

You believe your LLM is not delusional? Think again! a study of LLM hallucination on foundation models under perturbation
by: Anirban Saha, et al.
Published: (2025-05-01)

LegalMind: Agentic AI-Driven Process Optimization and Cost Reduction in Legal Services Using DeepSeek
by: Nidadavolu Venkat Durga Sai Siva Vara Prasad Raju, et al.
Published: (2025-01-01)

Ai-infused Immersion: Cultivating Efl Learners’ Intercultural Sensitivity Through Google’s Gemini Ai Chatbot
by: Amina Sellami
Published: (2025-07-01)

Development of Chiller Plant Models in OpenAI Gym Environment for Evaluating Reinforcement Learning Algorithms
by: Xiangrui Wang, et al.
Published: (2025-04-01)

Performance of ChatGPT-4 Omni and Gemini 1.5 Pro on Ophthalmology-Related Questions in the Turkish Medical Specialty Exam
by: Mehmet Cem Sabaner, et al.
Published: (2025-08-01)

The green algorithm: can sustainability define the winner in the AI race?
by: Sebastián Rivero-Silva, et al.
Published: (2025-07-01)

Developing Frugal Internet of Things with Backpropagation Neural Network for Predicting Impact of Gemini Artificial Intelligence on Student Meditation and Relaxation
by: Chun-Kai Tseng, et al.
Published: (2025-04-01)

Can deepseek and ChatGPT be used in the diagnosis of oral pathologies?
by: Ömer Faruk Kaygisiz, et al.
Published: (2025-04-01)

Google Gemini as a Learning Assistant: Exploring Student Perceptions
by: Majidah, et al.
Published: (2025-05-01)

In vivo evaluation the efficiency of nitazoxanide with cationic Gemini surfactant on Cryptosporidiosis
by: Zeinab Ahmed, et al.
Published: (2023-12-01)

SiAkif-Bots: Gemini AI for Academic Service Chatbots
by: Bunga Laelatul Muna, et al.
Published: (2025-06-01)

Exploration and Prospects of DeepSeek Applications in Engineering Hydrology
by: GAO Zi-xuan, SONG Xin-yi
Published: (2025-08-01)

Comparative analysis of accuracy and completeness in standardized database generation for complex multilingual lung cancer pathological reports: large language model-based assisted diagnosis system vs. DeepSeek, GPT-3.5, and healthcare professionals with varied professional titles, with task load variation assessment among medical staff
by: Hao Hang, et al.
Published: (2025-08-01)

Three novel Gemini amide amphiphilics synthesis, characterization, thermodynamics, surface properties and biological activity
by: M.G. Gab-Allah, et al.
Published: (2023-06-01)