Evaluation of large language models in patient education and clinical decision support for rotator cuff injury: a two-phase benchmarking study

Abstract Objective This study evaluates the accuracy of ChatGPT-4o, ChatGPT-o1, Gemini, and ERNIE Bot in answering rotator cuff injury questions and responding to patients. Results show Gemini excels in accuracy, while ChatGPT-4o performs better in patient interactions. Methods Phase 1: Four LLM cha...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yi-Lin Wang, Li-Chao Tian, Jing-Yuan Meng, Jie-Chao Zhang, Zhi-Xing Nie, Wen-Rui Wei, Dao-fang Ding, Xiao-Ye Tang, Qian Zhang, Yong He
Format:	Article
Language:	English
Published:	BMC 2025-08-01
Series:	BMC Medical Informatics and Decision Making
Subjects:	Large language model Patient education Rotator cuff injury Real world interview
Online Access:	https://doi.org/10.1186/s12911-025-03105-5
Tags:	Add Tag No Tags, Be the first to tag this record!

Internet

https://doi.org/10.1186/s12911-025-03105-5

Evaluation of large language models in patient education and clinical decision support for rotator cuff injury: a two-phase benchmarking study

Internet

Similar Items