Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study

Abstract BackgroundThe disease burden of breast cancer is increasing in China. Guiding people to obtain accurate information on breast cancer and improving the public’s health literacy are crucial for the early detection and timely treatment of breast cancer. Large language mo...

Full description

Saved in:
Bibliographic Details
Main Authors: Hui Liu, Jialun Peng, Lu Li, Ao Deng, XiangXin Huang, Guobing Yin, Haojun Luo
Format: Article
Language:English
Published: JMIR Publications 2025-05-01
Series:JMIR Medical Informatics
Online Access:https://medinform.jmir.org/2025/1/e66429
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract BackgroundThe disease burden of breast cancer is increasing in China. Guiding people to obtain accurate information on breast cancer and improving the public’s health literacy are crucial for the early detection and timely treatment of breast cancer. Large language model (LLM) is a currently popular source of health information. However, the accuracy and practicality of the breast cancer–related information provided by LLMs have not yet been evaluated. ObjectiveThis study aims to evaluate and compare the accuracy, practicality, and generalization-specificity of responses to breast cancer–related questions from two LLMs, ChatGPT and ERNIE Bot (EB). MethodsThe questions asked to the LLMs consisted of a patient questionnaire and an expert questionnaire, each containing 15 questions. ChatGPT was queried in both Chinese and English, recorded as ChatGPT-Chinese (ChatGPT-C) and ChatGPT-English (ChatGPT-E) respectively, while EB was queried in Chinese. The accuracy, practicality, and generalization-specificity of each inquiry’s responses were rated by a breast cancer multidisciplinary treatment team using Likert scales. ResultsOverall, for both the patient and expert questionnaire, the accuracy and practicality of responses from ChatGPT-E were significantly higher than those from ChatGPT-C and EB (all Ps ConclusionsCurrently, compared to other LLMs, ChatGPT-E has demonstrated greater potential for application in educating Chinese patients with breast cancer, and may serve as an effective tool for them to obtain health information. However, for breast cancer specialists, these LLMs are not yet suitable for assisting in clinical diagnosis or treatment activities. Additionally, data security, ethical, and legal risks associated with using LLMs in clinical practice cannot be ignored. In the future, further research is needed to determine the true efficacy of LLMs in clinical scenarios related to breast cancer in China.
ISSN:2291-9694