Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study
Abstract BackgroundThe disease burden of breast cancer is increasing in China. Guiding people to obtain accurate information on breast cancer and improving the public’s health literacy are crucial for the early detection and timely treatment of breast cancer. Large language mo...
Saved in:
| Main Authors: | , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
JMIR Publications
2025-05-01
|
| Series: | JMIR Medical Informatics |
| Online Access: | https://medinform.jmir.org/2025/1/e66429 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849696844000526336 |
|---|---|
| author | Hui Liu Jialun Peng Lu Li Ao Deng XiangXin Huang Guobing Yin Haojun Luo |
| author_facet | Hui Liu Jialun Peng Lu Li Ao Deng XiangXin Huang Guobing Yin Haojun Luo |
| author_sort | Hui Liu |
| collection | DOAJ |
| description |
Abstract
BackgroundThe disease burden of breast cancer is increasing in China. Guiding people to obtain accurate information on breast cancer and improving the public’s health literacy are crucial for the early detection and timely treatment of breast cancer. Large language model (LLM) is a currently popular source of health information. However, the accuracy and practicality of the breast cancer–related information provided by LLMs have not yet been evaluated.
ObjectiveThis study aims to evaluate and compare the accuracy, practicality, and generalization-specificity of responses to breast cancer–related questions from two LLMs, ChatGPT and ERNIE Bot (EB).
MethodsThe questions asked to the LLMs consisted of a patient questionnaire and an expert questionnaire, each containing 15 questions. ChatGPT was queried in both Chinese and English, recorded as ChatGPT-Chinese (ChatGPT-C) and ChatGPT-English (ChatGPT-E) respectively, while EB was queried in Chinese. The accuracy, practicality, and generalization-specificity of each inquiry’s responses were rated by a breast cancer multidisciplinary treatment team using Likert scales.
ResultsOverall, for both the patient and expert questionnaire, the accuracy and practicality of responses from ChatGPT-E were significantly higher than those from ChatGPT-C and EB (all Ps
ConclusionsCurrently, compared to other LLMs, ChatGPT-E has demonstrated greater potential for application in educating Chinese patients with breast cancer, and may serve as an effective tool for them to obtain health information. However, for breast cancer specialists, these LLMs are not yet suitable for assisting in clinical diagnosis or treatment activities. Additionally, data security, ethical, and legal risks associated with using LLMs in clinical practice cannot be ignored. In the future, further research is needed to determine the true efficacy of LLMs in clinical scenarios related to breast cancer in China. |
| format | Article |
| id | doaj-art-25cd0556b2d645fb822b843a66cce3ca |
| institution | DOAJ |
| issn | 2291-9694 |
| language | English |
| publishDate | 2025-05-01 |
| publisher | JMIR Publications |
| record_format | Article |
| series | JMIR Medical Informatics |
| spelling | doaj-art-25cd0556b2d645fb822b843a66cce3ca2025-08-20T03:19:20ZengJMIR PublicationsJMIR Medical Informatics2291-96942025-05-0113e66429e6642910.2196/66429Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire StudyHui Liuhttp://orcid.org/0009-0006-9179-488XJialun Penghttp://orcid.org/0009-0003-9075-0154Lu Lihttp://orcid.org/0009-0009-8381-3093Ao Denghttp://orcid.org/0009-0006-2131-5484XiangXin Huanghttp://orcid.org/0009-0003-4019-8955Guobing Yinhttp://orcid.org/0009-0009-5939-9531Haojun Luohttp://orcid.org/0000-0002-6860-0251 Abstract BackgroundThe disease burden of breast cancer is increasing in China. Guiding people to obtain accurate information on breast cancer and improving the public’s health literacy are crucial for the early detection and timely treatment of breast cancer. Large language model (LLM) is a currently popular source of health information. However, the accuracy and practicality of the breast cancer–related information provided by LLMs have not yet been evaluated. ObjectiveThis study aims to evaluate and compare the accuracy, practicality, and generalization-specificity of responses to breast cancer–related questions from two LLMs, ChatGPT and ERNIE Bot (EB). MethodsThe questions asked to the LLMs consisted of a patient questionnaire and an expert questionnaire, each containing 15 questions. ChatGPT was queried in both Chinese and English, recorded as ChatGPT-Chinese (ChatGPT-C) and ChatGPT-English (ChatGPT-E) respectively, while EB was queried in Chinese. The accuracy, practicality, and generalization-specificity of each inquiry’s responses were rated by a breast cancer multidisciplinary treatment team using Likert scales. ResultsOverall, for both the patient and expert questionnaire, the accuracy and practicality of responses from ChatGPT-E were significantly higher than those from ChatGPT-C and EB (all Ps ConclusionsCurrently, compared to other LLMs, ChatGPT-E has demonstrated greater potential for application in educating Chinese patients with breast cancer, and may serve as an effective tool for them to obtain health information. However, for breast cancer specialists, these LLMs are not yet suitable for assisting in clinical diagnosis or treatment activities. Additionally, data security, ethical, and legal risks associated with using LLMs in clinical practice cannot be ignored. In the future, further research is needed to determine the true efficacy of LLMs in clinical scenarios related to breast cancer in China.https://medinform.jmir.org/2025/1/e66429 |
| spellingShingle | Hui Liu Jialun Peng Lu Li Ao Deng XiangXin Huang Guobing Yin Haojun Luo Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study JMIR Medical Informatics |
| title | Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study |
| title_full | Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study |
| title_fullStr | Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study |
| title_full_unstemmed | Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study |
| title_short | Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study |
| title_sort | large language models as a consulting hotline for patients with breast cancer and specialists in china cross sectional questionnaire study |
| url | https://medinform.jmir.org/2025/1/e66429 |
| work_keys_str_mv | AT huiliu largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT jialunpeng largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT luli largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT aodeng largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT xiangxinhuang largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT guobingyin largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy AT haojunluo largelanguagemodelsasaconsultinghotlineforpatientswithbreastcancerandspecialistsinchinacrosssectionalquestionnairestudy |