A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort

Abstract Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) models in practical settings limits their c...

Full description

Saved in:

Bibliographic Details
Main Authors:	Yifan Jiang, Leyla Ebrahimpour, Philippe Després, Venkata SK. Manem
Format:	Article
Language:	English
Published:	Nature Portfolio 2025-01-01
Series:	Scientific Reports
Online Access:	https://doi.org/10.1038/s41598-024-84193-7
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1850099564416073728
author	Yifan Jiang Leyla Ebrahimpour Philippe Després Venkata SK. Manem
author_facet	Yifan Jiang Leyla Ebrahimpour Philippe Després Venkata SK. Manem
author_sort	Yifan Jiang
collection	DOAJ
description	Abstract Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) models in practical settings limits their clinical application. This study aims to review and analyze current SOTA deep learning models for lung cancer risk prediction (malignant-benign classification). To evaluate our model’s general performance, we selected 253 out of 467 patients from a subset of the National Lung Screening Trial (NLST) who had CT scans without contrast, which are the most commonly used, and divided them into training and test cohorts. The CT scans were preprocessed into 2D-image and 3D-volume formats according to their nodule annotations. We evaluated ten 3D and eleven 2D SOTA deep learning models, which were pretrained on large-scale general-purpose datasets (Kinetics and ImageNet) and radiological datasets (3DSeg-8, nnUnet and RadImageNet), for their lung cancer risk prediction performance. Our results showed that 3D-based deep learning models generally perform better than 2D models. On the test cohort, the best-performing 3D model achieved an AUROC of 0.86, while the best 2D model reached 0.79. The lowest AUROCs for the 3D and 2D models were 0.70 and 0.62, respectively. Furthermore, pretraining on large-scale radiological image datasets did not show the expected performance advantage over pretraining on general-purpose datasets. Both 2D and 3D deep learning models can handle lung cancer risk prediction tasks effectively, although 3D models generally have superior performance than their 2D competitors. Our findings highlight the importance of carefully selecting pretrained datasets and model architectures for lung cancer risk prediction. Overall, these results have important implications for the development and clinical integration of DL-based tools in lung cancer screening.
format	Article
id	doaj-art-77fbd92f0b874fc9a5beaa897cdded6b
institution	DOAJ
issn	2045-2322
language	English
publishDate	2025-01-01
publisher	Nature Portfolio
record_format	Article
series	Scientific Reports
spelling	doaj-art-77fbd92f0b874fc9a5beaa897cdded6b2025-08-20T02:40:28ZengNature PortfolioScientific Reports2045-23222025-01-0115111210.1038/s41598-024-84193-7A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohortYifan Jiang0Leyla Ebrahimpour1Philippe Després2Venkata SK. Manem3Centre de recherche du CHU de Québec-Université LavalCentre de recherche du CHU de Québec-Université LavalDépartement de physique, de génie physique et d’optique, Université LavalCentre de recherche du CHU de Québec-Université LavalAbstract Deep learning (DL) methods have demonstrated remarkable effectiveness in assisting with lung cancer risk prediction tasks using computed tomography (CT) scans. However, the lack of comprehensive comparison and validation of state-of-the-art (SOTA) models in practical settings limits their clinical application. This study aims to review and analyze current SOTA deep learning models for lung cancer risk prediction (malignant-benign classification). To evaluate our model’s general performance, we selected 253 out of 467 patients from a subset of the National Lung Screening Trial (NLST) who had CT scans without contrast, which are the most commonly used, and divided them into training and test cohorts. The CT scans were preprocessed into 2D-image and 3D-volume formats according to their nodule annotations. We evaluated ten 3D and eleven 2D SOTA deep learning models, which were pretrained on large-scale general-purpose datasets (Kinetics and ImageNet) and radiological datasets (3DSeg-8, nnUnet and RadImageNet), for their lung cancer risk prediction performance. Our results showed that 3D-based deep learning models generally perform better than 2D models. On the test cohort, the best-performing 3D model achieved an AUROC of 0.86, while the best 2D model reached 0.79. The lowest AUROCs for the 3D and 2D models were 0.70 and 0.62, respectively. Furthermore, pretraining on large-scale radiological image datasets did not show the expected performance advantage over pretraining on general-purpose datasets. Both 2D and 3D deep learning models can handle lung cancer risk prediction tasks effectively, although 3D models generally have superior performance than their 2D competitors. Our findings highlight the importance of carefully selecting pretrained datasets and model architectures for lung cancer risk prediction. Overall, these results have important implications for the development and clinical integration of DL-based tools in lung cancer screening.https://doi.org/10.1038/s41598-024-84193-7
spellingShingle	Yifan Jiang Leyla Ebrahimpour Philippe Després Venkata SK. Manem A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort Scientific Reports
title	A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
title_full	A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
title_fullStr	A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
title_full_unstemmed	A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
title_short	A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
title_sort	benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort
url	https://doi.org/10.1038/s41598-024-84193-7
work_keys_str_mv	AT yifanjiang abenchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT leylaebrahimpour abenchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT philippedespres abenchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT venkataskmanem abenchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT yifanjiang benchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT leylaebrahimpour benchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT philippedespres benchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort AT venkataskmanem benchmarkofdeeplearningapproachestopredictlungcancerriskusingnationallungscreeningtrialcohort

A benchmark of deep learning approaches to predict lung cancer risk using national lung screening trial cohort

Similar Items