Large language model integrations in cancer decision-making: a systematic review and meta-analysis

Abstract Large Language Models (LLMs) are increasingly used to support cancer patients and clinicians in decision-making. This systematic review investigates how LLMs are integrated into oncology and evaluated by researchers. We conducted a comprehensive search across PubMed, Web of Science, Scopus,...

Full description

Saved in:
Bibliographic Details
Main Authors: Yuexing Hao, Zhiwen Qiu, Jason Holmes, Corinna E. Löckenhoff, Wei Liu, Marzyeh Ghassemi, Saleh Kalantari
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:npj Digital Medicine
Online Access:https://doi.org/10.1038/s41746-025-01824-7
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Large Language Models (LLMs) are increasingly used to support cancer patients and clinicians in decision-making. This systematic review investigates how LLMs are integrated into oncology and evaluated by researchers. We conducted a comprehensive search across PubMed, Web of Science, Scopus, and the ACM Digital Library through May 2024, identifying 56 studies covering 15 cancer types. The meta-analysis results suggested that LLMs were commonly used to summarize, translate, and communicate clinical information, but performance varied: the average overall accuracy was 76.2%, with average diagnostic accuracy lower at 67.4%, revealing gaps in the clinical readiness of this technology. Most evaluations relied heavily on quantitative datasets and automated methods without human graders, emphasizing “accuracy” and “appropriateness” while rarely addressing “safety”, “harm”, or “clarity”. Current limitations for LLMs in cancer decision-making, such as limited domain knowledge and dependence on human oversight, demonstrate the need for open datasets and standardized evaluations to improve reliability.
ISSN:2398-6352