Comparing ChatGPT-3.5 and ChatGPT-4’s alignments with the German evidence-based S3 guideline for adult soft tissue sarcoma
Summary: Clinical reliability assessment of large language models is necessary due to their increasing use in healthcare. This study assessed the performance of ChatGPT-3.5 and ChatGPT-4 in answering questions deducted from the German evidence-based S3 guideline for adult soft tissue sarcoma (STS)....
Saved in:
| Main Authors: | Cheng-Peng Li, Jens Jakob, Franka Menge, Christoph Reißfelder, Peter Hohenberger, Cui Yang |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Elsevier
2024-12-01
|
| Series: | iScience |
| Subjects: | |
| Online Access: | http://www.sciencedirect.com/science/article/pii/S2589004224027202 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
-
Comparative analysis of ChatGPT 3.5 and ChatGPT 4 obstetric and gynecological knowledge
by: Franciszek Ługowski, et al.
Published: (2025-07-01) -
Performance of ChatGPT-3.5 and ChatGPT-4 in the Taiwan National Pharmacist Licensing Examination: Comparative Evaluation Study
by: Ying-Mei Wang, et al.
Published: (2025-01-01) -
Performance of ChatGPT-3.5 and ChatGPT-4 in the field of specialist medical knowledge on National Specialization Exam in neurosurgery
by: Maciej Laskowski, et al.
Published: (2024-10-01) -
Accuracy, appropriateness, and readability of ChatGPT-4 and ChatGPT-3.5 in answering pediatric emergency medicine post-discharge questions
by: Mitul Gupta, et al.
Published: (2025-04-01) -
ChatGPT Conversations on Oral Cancer: Unveiling ChatGPT's Potential and Pitfalls
by: Nikunj Maniyar, et al.
Published: (2024-06-01)