Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms

Abstract Faced with challenging cases, doctors are increasingly seeking diagnostic advice from large language models (LLMs). This study aims to compare the ability of LLMs and human physicians to diagnose challenging cases. An offline dataset of 67 challenging cases with primary gastrointestinal sym...

Full description

Saved in:
Bibliographic Details
Main Authors: Xintian Yang, Tongxin Li, Han Wang, Rongchun Zhang, Zhi Ni, Na Liu, Huihong Zhai, Jianghai Zhao, Fandong Meng, Zhongyin Zhou, Shanhong Tang, Limei Wang, Xiangping Wang, Hui Luo, Gui Ren, Linhui Zhang, Xiaoyu Kang, Jun Wang, Ning Bo, Xiaoning Yang, Weijie Xue, Xiaoyin Zhang, Ning Chen, Rui Guo, Baiwen Li, Yajun Li, Yaling Liu, Tiantian Zhang, Shuhui Liang, Yong Lv, Yongzhan Nie, Daiming Fan, Lina Zhao, Yanglin Pan
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-025-01486-5
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1823861540954046464
author Xintian Yang
Tongxin Li
Han Wang
Rongchun Zhang
Zhi Ni
Na Liu
Huihong Zhai
Jianghai Zhao
Fandong Meng
Zhongyin Zhou
Shanhong Tang
Limei Wang
Xiangping Wang
Hui Luo
Gui Ren
Linhui Zhang
Xiaoyu Kang
Jun Wang
Ning Bo
Xiaoning Yang
Weijie Xue
Xiaoyin Zhang
Ning Chen
Rui Guo
Baiwen Li
Yajun Li
Yaling Liu
Tiantian Zhang
Shuhui Liang
Yong Lv
Yongzhan Nie
Daiming Fan
Lina Zhao
Yanglin Pan
author_facet Xintian Yang
Tongxin Li
Han Wang
Rongchun Zhang
Zhi Ni
Na Liu
Huihong Zhai
Jianghai Zhao
Fandong Meng
Zhongyin Zhou
Shanhong Tang
Limei Wang
Xiangping Wang
Hui Luo
Gui Ren
Linhui Zhang
Xiaoyu Kang
Jun Wang
Ning Bo
Xiaoning Yang
Weijie Xue
Xiaoyin Zhang
Ning Chen
Rui Guo
Baiwen Li
Yajun Li
Yaling Liu
Tiantian Zhang
Shuhui Liang
Yong Lv
Yongzhan Nie
Daiming Fan
Lina Zhao
Yanglin Pan
author_sort Xintian Yang
collection DOAJ
description Abstract Faced with challenging cases, doctors are increasingly seeking diagnostic advice from large language models (LLMs). This study aims to compare the ability of LLMs and human physicians to diagnose challenging cases. An offline dataset of 67 challenging cases with primary gastrointestinal symptoms was used to solicit possible diagnoses from seven LLMs and 22 gastroenterologists. The diagnoses by Claude 3.5 Sonnet covered the highest proportion (95% confidence interval [CI]) of instructive diagnoses (76.1%, [70.6%–80.9%]), significantly surpassing all the gastroenterologists (p < 0.05 for all). Claude 3.5 Sonnet achieved a significantly higher coverage rate (95% CI) than that of the gastroenterologists using search engines or other traditional resource (76.1% [70.6%–80.9%] vs. 45.5% [40.7%-50.4%], p < 0.001). The study highlights that advanced LLMs may assist gastroenterologists with instructive, time-saving, and cost-effective diagnostic scopes in challenging cases.
format Article
id doaj-art-32978c9a4f39481bacda086566ca7d43
institution Kabale University
issn 2398-6352
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series npj Digital Medicine
spelling doaj-art-32978c9a4f39481bacda086566ca7d432025-02-09T12:55:36ZengNature Portfolionpj Digital Medicine2398-63522025-02-018111210.1038/s41746-025-01486-5Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptomsXintian Yang0Tongxin Li1Han Wang2Rongchun Zhang3Zhi Ni4Na Liu5Huihong Zhai6Jianghai Zhao7Fandong Meng8Zhongyin Zhou9Shanhong Tang10Limei Wang11Xiangping Wang12Hui Luo13Gui Ren14Linhui Zhang15Xiaoyu Kang16Jun Wang17Ning Bo18Xiaoning Yang19Weijie Xue20Xiaoyin Zhang21Ning Chen22Rui Guo23Baiwen Li24Yajun Li25Yaling Liu26Tiantian Zhang27Shuhui Liang28Yong Lv29Yongzhan Nie30Daiming Fan31Lina Zhao32Yanglin Pan33State Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityDepartment of Pathology, Union Hospital, Tongji Medical College, Huazhong University of Science and TechnologyDepartment of Gastroenterology, Xiamen Humanity Hospital, Fujian Medical UniversityDepartment of Gastroenterology, Xiamen Humanity Hospital, Fujian Medical UniversityDepartment of Gastroenterology, Hainan General Hospital (Hainan Affiliated Hospital of Hainan Medical University)Department of Gastroenterology, Xuanwu Hospital, Capital Medical UniversityDepartment of Gastroenterology, Huaihe Hospital of Henan UniversityDepartment of Gastroenterology, Beijing Friendship Hospital, Capital Medical University, National Clinical Research Center for Digestive Disease, Beijing Digestive Disease Center, Beijing Key Laboratory for Precancerous Lesion of Digestive DiseaseDepartment of Gastroenterology, Renmin Hospital of Wuhan UniversityDepartment of Gastroenterology, The General Hospital of Western Theater CommandDepartment of Gastroenterology, Shaanxi Second Provincial People’s HospitalState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityDepartment of Gastroenterology, The 986th Hospital of Xijing Hospital, Air Force Military Medical UniversityDepartment of Gastroenterology, The Second Affiliated Hospital of Chongqing Medical UniversityDepartment of Gastroenterology, Zhongshan Hospital of Xiamen University, School of Medicine, Xiamen UniversityDepartment of Transplantation and Pediatric Surgery, Kumamoto University HospitalDepartment of Gastroenterology, National Clinical Research Center of Infectious Disease, The Third People’s Hospital of Shenzhen, The Second Affiliated Hospital of Southern University of Science and TechnologyDepartment of Gastroenterology, Peking University People’s Hospital, Peking UniversityDepartment of Gastroenterology, Beijing Shijingshan Hospital, Capital Medical UniversityDepartment of Gastroenterology, Shanghai General Hospital, Shanghai Jiao Tong University School of MedicineDepartment of Gastroenterology, General Hospital of Ningxia Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityDepartment of Radiotherapy, Xijing Hospital, Fourth Military Medical UniversityState Key Laboratory of Holistic Integrative Management of Gastrointestinal Cancers and National Clinical Research Center for Digestive Diseases, Xijing Hospital of Digestive Diseases, Fourth Military Medical UniversityAbstract Faced with challenging cases, doctors are increasingly seeking diagnostic advice from large language models (LLMs). This study aims to compare the ability of LLMs and human physicians to diagnose challenging cases. An offline dataset of 67 challenging cases with primary gastrointestinal symptoms was used to solicit possible diagnoses from seven LLMs and 22 gastroenterologists. The diagnoses by Claude 3.5 Sonnet covered the highest proportion (95% confidence interval [CI]) of instructive diagnoses (76.1%, [70.6%–80.9%]), significantly surpassing all the gastroenterologists (p < 0.05 for all). Claude 3.5 Sonnet achieved a significantly higher coverage rate (95% CI) than that of the gastroenterologists using search engines or other traditional resource (76.1% [70.6%–80.9%] vs. 45.5% [40.7%-50.4%], p < 0.001). The study highlights that advanced LLMs may assist gastroenterologists with instructive, time-saving, and cost-effective diagnostic scopes in challenging cases.https://doi.org/10.1038/s41746-025-01486-5
spellingShingle Xintian Yang
Tongxin Li
Han Wang
Rongchun Zhang
Zhi Ni
Na Liu
Huihong Zhai
Jianghai Zhao
Fandong Meng
Zhongyin Zhou
Shanhong Tang
Limei Wang
Xiangping Wang
Hui Luo
Gui Ren
Linhui Zhang
Xiaoyu Kang
Jun Wang
Ning Bo
Xiaoning Yang
Weijie Xue
Xiaoyin Zhang
Ning Chen
Rui Guo
Baiwen Li
Yajun Li
Yaling Liu
Tiantian Zhang
Shuhui Liang
Yong Lv
Yongzhan Nie
Daiming Fan
Lina Zhao
Yanglin Pan
Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
npj Digital Medicine
title Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
title_full Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
title_fullStr Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
title_full_unstemmed Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
title_short Multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
title_sort multiple large language models versus experienced physicians in diagnosing challenging cases with gastrointestinal symptoms
url https://doi.org/10.1038/s41746-025-01486-5
work_keys_str_mv AT xintianyang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT tongxinli multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT hanwang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT rongchunzhang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT zhini multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT naliu multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT huihongzhai multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT jianghaizhao multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT fandongmeng multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT zhongyinzhou multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT shanhongtang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT limeiwang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT xiangpingwang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT huiluo multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT guiren multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT linhuizhang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT xiaoyukang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT junwang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT ningbo multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT xiaoningyang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT weijiexue multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT xiaoyinzhang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT ningchen multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT ruiguo multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT baiwenli multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT yajunli multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT yalingliu multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT tiantianzhang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT shuhuiliang multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT yonglv multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT yongzhannie multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT daimingfan multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT linazhao multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms
AT yanglinpan multiplelargelanguagemodelsversusexperiencedphysiciansindiagnosingchallengingcaseswithgastrointestinalsymptoms