Impact of large language models and vision deep learning models in predicting neoadjuvant rectal score for rectal cancer treated with neoadjuvant chemoradiation

Abstract This study aims to explore Deep Learning methods, namely Large Language Models (LLMs) and Computer Vision models to accurately predict neoadjuvant rectal (NAR) score for locally advanced rectal cancer (LARC) treated with neoadjuvant chemoradiation (NACRT). The NAR score is a validated surro...

Full description

Saved in:
Bibliographic Details
Main Authors: Hyun Bin Kim, Hong Qi Tan, Wen Long Nei, Ying Cong Ryan Shea Tan, Yiyu Cai, Fuqiang Wang
Format: Article
Language:English
Published: BMC 2025-07-01
Series:BMC Medical Imaging
Subjects:
Online Access:https://doi.org/10.1186/s12880-025-01844-5
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract This study aims to explore Deep Learning methods, namely Large Language Models (LLMs) and Computer Vision models to accurately predict neoadjuvant rectal (NAR) score for locally advanced rectal cancer (LARC) treated with neoadjuvant chemoradiation (NACRT). The NAR score is a validated surrogate endpoint for LARC. 160 CT scans of patients were used in this study, along with 4 different types of radiology reports, 2 generated from CT scans and other 2 from MRI scans, both before and after NACRT. For CT scans, two different approaches with convolutional neural network were utilized to tackle the 3D scan entirely or tackle it slice by slice. For radiology reports, an encoder architecture LLM was used. The performance of the approaches was quantified by the Area under the Receiver Operating Characteristic curve (AUC). The two different approaches for CT scans yielded $$\:0.537\:(\pm\:0.095)$$ and $$\:0.541\:(\pm\:0.072)$$ while the LLM trained on post NACRT MRI reports showed the most predictive potential at $$\:0.708\:(\pm\:0.041)$$ and a statistical improvement, p = 0.03, over the baseline clinical approach (from $$\:0.687\:(\pm\:0.136)$$ to $$\:0.726\:(\pm\:0.106$$ )). This study showcases the potential of Large Language Models and the inadequacies of CT scans in predicting NAR values. Clinical trial number Not applicable.
ISSN:1471-2342