Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model

Abstract Background Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in var...

Full description

Saved in:

Bibliographic Details
Main Authors:	Shuoyang Wei, Ankang Hu, Yongguang Liang, Jingru Yang, Lang Yu, Wenbo Li, Bo Yang, Jie Qiu
Format:	Article
Language:	English
Published:	BMC 2025-05-01
Series:	Radiation Oncology
Subjects:	Radiotherapy Treatment planning Large language model Automation
Online Access:	https://doi.org/10.1186/s13014-025-02660-5
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850154934515793920
author	Shuoyang Wei Ankang Hu Yongguang Liang Jingru Yang Lang Yu Wenbo Li Bo Yang Jie Qiu
author_facet	Shuoyang Wei Ankang Hu Yongguang Liang Jingru Yang Lang Yu Wenbo Li Bo Yang Jie Qiu
author_sort	Shuoyang Wei
collection	DOAJ
description	Abstract Background Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in various domains, presents an opportunity to automate and enhance this optimization process. Purpose This study seeks to evaluate the capacity of LLMs to generate radiotherapy treatment plans comparable to those crafted by human medical physicists, focusing on target volume conformity and organs-at-risk (OARs) dose sparing. The goal is to automate the optimization process of radiotherapy treatment plans through the utilization of LLMs. Methods Multiple LLMs were employed to adjust optimization parameters for radiotherapy treatment plans, using a dataset comprising 35 cervical cancer patients treated with volumetric modulated arc therapy (VMAT). Customized prompts were applied to 5 patients to tailor the LLMs, which were subsequently tested on 30 patients. Evaluation metrics included target volume conformity, dose homogeneity, monitor units (MU) value, and OARs dose sparing, comparing plans generated by various LLMs to manual plans. Results With the exception of Gemini-1.5-flash, which faced challenges due to hallucinations, Qwen-2.5-max and Llama-3.2 produced acceptable VMAT plans in 16.3 ± 5.0 and 9.8 ± 2.1 min, respectively, outperforming an experienced human physicist’s time cost of about 20 min. The average conformity index (CI) for Qwen-2.5-max plans, Llama-3.2 plans, and manual plans on the test set were 0.929 ± 0.007, 0.928 ± 0.007, and 0.926 ± 0.007, respectively. The average homogeneity index (HI) was 0.058 ± 0.006, 0.059 ± 0.005, and 0.065 ± 0.006, respectively. While there was a significant difference in target volume conformity between LLM plans and manual plans, OARs dose sparing showed no significant variations. In lateral comparisons among different LLMs, no statistically significant differences were observed in the PTV dose, OARs dose sparing, and target volume conformity between Qwen-2.5-max and Llama-3.2 plans. Conclusions Through an assessment of LLM-generated plans and clinical plans in terms of target volume conformity and OARs dose sparing, this study provides preliminary evidence supporting the viability of LLMs for optimizing radiotherapy treatment plans. The implementation of LLMs demonstrates the potential for enhancing clinical workflows and reducing the workload associated with treatment planning.
format	Article
id	doaj-art-6d391e736995418eba0e87a4e7368d9c
institution	OA Journals
issn	1748-717X
language	English
publishDate	2025-05-01
publisher	BMC
record_format	Article
series	Radiation Oncology
spelling	doaj-art-6d391e736995418eba0e87a4e7368d9c2025-08-20T02:25:08ZengBMCRadiation Oncology1748-717X2025-05-0120111310.1186/s13014-025-02660-5Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language modelShuoyang Wei0Ankang Hu1Yongguang Liang2Jingru Yang3Lang Yu4Wenbo Li5Bo Yang6Jie Qiu7Department of Radiotherapy, Peking Union Medical College HospitalDepartment of Engineering Physics, Tsinghua UniversityDepartment of Radiotherapy, Peking Union Medical College HospitalDepartment of Radiotherapy, Peking Union Medical College HospitalDepartment of Radiotherapy, Peking Union Medical College HospitalDepartment of Radiotherapy, Peking Union Medical College HospitalDepartment of Radiotherapy, Peking Union Medical College HospitalDepartment of Radiotherapy, Peking Union Medical College HospitalAbstract Background Radiotherapy treatment planning traditionally involves complex and time-consuming processes, often relying on trial-and-error methods. The emergence of artificial intelligence, particularly Large Language Models (LLMs), surpassing human capabilities and existing algorithms in various domains, presents an opportunity to automate and enhance this optimization process. Purpose This study seeks to evaluate the capacity of LLMs to generate radiotherapy treatment plans comparable to those crafted by human medical physicists, focusing on target volume conformity and organs-at-risk (OARs) dose sparing. The goal is to automate the optimization process of radiotherapy treatment plans through the utilization of LLMs. Methods Multiple LLMs were employed to adjust optimization parameters for radiotherapy treatment plans, using a dataset comprising 35 cervical cancer patients treated with volumetric modulated arc therapy (VMAT). Customized prompts were applied to 5 patients to tailor the LLMs, which were subsequently tested on 30 patients. Evaluation metrics included target volume conformity, dose homogeneity, monitor units (MU) value, and OARs dose sparing, comparing plans generated by various LLMs to manual plans. Results With the exception of Gemini-1.5-flash, which faced challenges due to hallucinations, Qwen-2.5-max and Llama-3.2 produced acceptable VMAT plans in 16.3 ± 5.0 and 9.8 ± 2.1 min, respectively, outperforming an experienced human physicist’s time cost of about 20 min. The average conformity index (CI) for Qwen-2.5-max plans, Llama-3.2 plans, and manual plans on the test set were 0.929 ± 0.007, 0.928 ± 0.007, and 0.926 ± 0.007, respectively. The average homogeneity index (HI) was 0.058 ± 0.006, 0.059 ± 0.005, and 0.065 ± 0.006, respectively. While there was a significant difference in target volume conformity between LLM plans and manual plans, OARs dose sparing showed no significant variations. In lateral comparisons among different LLMs, no statistically significant differences were observed in the PTV dose, OARs dose sparing, and target volume conformity between Qwen-2.5-max and Llama-3.2 plans. Conclusions Through an assessment of LLM-generated plans and clinical plans in terms of target volume conformity and OARs dose sparing, this study provides preliminary evidence supporting the viability of LLMs for optimizing radiotherapy treatment plans. The implementation of LLMs demonstrates the potential for enhancing clinical workflows and reducing the workload associated with treatment planning.https://doi.org/10.1186/s13014-025-02660-5RadiotherapyTreatment planningLarge language modelAutomation
spellingShingle	Shuoyang Wei Ankang Hu Yongguang Liang Jingru Yang Lang Yu Wenbo Li Bo Yang Jie Qiu Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model Radiation Oncology Radiotherapy Treatment planning Large language model Automation
title	Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
title_full	Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
title_fullStr	Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
title_full_unstemmed	Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
title_short	Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
title_sort	feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model
topic	Radiotherapy Treatment planning Large language model Automation
url	https://doi.org/10.1186/s13014-025-02660-5
work_keys_str_mv	AT shuoyangwei feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT ankanghu feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT yongguangliang feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT jingruyang feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT langyu feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT wenboli feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT boyang feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel AT jieqiu feasibilitystudyofautomaticradiotherapytreatmentplanningforcervicalcancerusingalargelanguagemodel

Feasibility study of automatic radiotherapy treatment planning for cervical cancer using a large language model

Similar Items