Ensemble Machine Learning Classifiers Combining CT Radiomics and Clinical-Radiological Features for Preoperative Prediction of Pathological Invasiveness in Lung Adenocarcinoma Presenting as Part-Solid Nodules: A Multicenter Retrospective Study

Background Lung adenocarcinomas manifesting as part-solid nodules (PSNs) represent a distinct clinical subtype where accurate preoperative determination of pathological invasiveness critically influences both prognosis and surgical decision-making. This multicenter study aims to develop an ensemble...

Full description

Saved in:
Bibliographic Details
Main Authors: Yunhua Li BS, Jianbang Ding MS, Kun Wu MS, Wanyin Qi BS, Shanyue Lin BS, Gangwen Chen BS, Zhichao Zuo PhD
Format: Article
Language:English
Published: SAGE Publishing 2025-06-01
Series:Technology in Cancer Research & Treatment
Online Access:https://doi.org/10.1177/15330338251351365
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Background Lung adenocarcinomas manifesting as part-solid nodules (PSNs) represent a distinct clinical subtype where accurate preoperative determination of pathological invasiveness critically influences both prognosis and surgical decision-making. This multicenter study aims to develop an ensemble machine learning classifier that integrates computed tomography (CT) radiomic signatures with clinical-radiological features to enhance the preoperative prediction of invasive status. Methods We retrospectively analyzed 344 patients with pathologically confirmed lung adenocarcinoma presenting as PSNs across three medical centers. Following random allocation into training (n = 240) and validation (n = 104) sets (7:3 ratio), we extracted 1239 quantitative radiomic features from preoperative thin-section CT scans. Through rigorous feature engineering, we constructed a radiomic score using least absolute shrinkage and selection operator regression. We systematically evaluated both single-algorithm classifiers and ensemble approaches (including hard/soft voting and stacking), incorporating both the radiomic score and clinical-radiological features. Results Among the various evaluated machine learning models, the stacking classifier, which combines radiomic scores and clinical-radiological features, performed the best, achieving an AUC of 0.84, an accuracy of 0.817, an F1 score of 0.869, a precision of 0.818, and a recall of 0.926. Conclusion Our stacking ensemble learning classifier, which synergistically combines CT radiomics signatures with clinical-radiological features, provides a clinically actionable tool for the preoperative prediction of pathological invasiveness in PSN-type lung adenocarcinoma, thereby enhancing individualized surgical planning.
ISSN:1533-0338