Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer

Abstract Breast cancer is the most common type of cancer in women, and while current treatments can cure the majority of early-stage primary BC cases, recurrence remains a significant challenge. Traditional methods of assessing patient prognosis, such as AJCC, TNM staging, and biochemical markers, a...

Full description

Saved in:
Bibliographic Details
Main Authors: Junyi Li, Shixin Li, Dongpo Zhang, Yibing Zhu, Yue Wang, Xiaoxiao Xing, Juefei Mo, Yong Zhang, Daixiang Liao, Jun Li
Format: Article
Language:English
Published: Nature Portfolio 2025-06-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-03423-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849434352181575680
author Junyi Li
Shixin Li
Dongpo Zhang
Yibing Zhu
Yue Wang
Xiaoxiao Xing
Juefei Mo
Yong Zhang
Daixiang Liao
Jun Li
author_facet Junyi Li
Shixin Li
Dongpo Zhang
Yibing Zhu
Yue Wang
Xiaoxiao Xing
Juefei Mo
Yong Zhang
Daixiang Liao
Jun Li
author_sort Junyi Li
collection DOAJ
description Abstract Breast cancer is the most common type of cancer in women, and while current treatments can cure the majority of early-stage primary BC cases, recurrence remains a significant challenge. Traditional methods of assessing patient prognosis, such as AJCC, TNM staging, and biochemical markers, are no longer sufficient in the era of precision medicine. Existing tumor models often rely on single selection and simpler algorithms, which can lead to poor effectiveness or overfitting. To address these limitations, this study systematically analyzed RNA-seq high-throughput data and combined 10 machine learning algorithms to construct 117 models. The optimal algorithm combination, StepCox[both] and ridge regression, was identified, and an immune-related gene signature (IRGS) composed of 12 genes was developed. The IRGS demonstrated outstanding predictive performance across multiple datasets and surpassed 10 previously published signatures. GSEA analysis revealed significant enrichment differences in cellular processes, diseases, and immune-related pathways between high- and low-risk recurrence patients. The low recurrence risk group based on IRGS exhibited a stronger immune phenotype and better survival prognosis, which may be associated with higher infiltration of CD4 + and CD8 + T cells. However, high M2 macrophage infiltration suggests potential immune escape in low recurrence risk patients. Combined with immune checkpoint expression levels and TIDE results, it is suggested that low-risk patients may respond positively to immunotherapy. Through drug sensitivity analysis, potential drugs that are more effective for both high- and low-risk groups have been identified. Therefore, the IRGS developed in this study can serve as an adjunct tool for assessing the recurrence risk of breast cancer, potentially enhancing personalized treatment planning, and improving the clinical management of patients with breast cancer.
format Article
id doaj-art-bdd785d2dd8b457aab17e9a4ad08239d
institution Kabale University
issn 2045-2322
language English
publishDate 2025-06-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-bdd785d2dd8b457aab17e9a4ad08239d2025-08-20T03:26:42ZengNature PortfolioScientific Reports2045-23222025-06-0115111610.1038/s41598-025-03423-8Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancerJunyi Li0Shixin Li1Dongpo Zhang2Yibing Zhu3Yue Wang4Xiaoxiao Xing5Juefei Mo6Yong Zhang7Daixiang Liao8Jun Li9Department of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Oncology, The First Affiliated Hospital of Guizhou University of Traditional Chinese MedicineDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesDepartment of Surgery, Guang ’anmen Hospital, China Academy of Chinese Medical SciencesAbstract Breast cancer is the most common type of cancer in women, and while current treatments can cure the majority of early-stage primary BC cases, recurrence remains a significant challenge. Traditional methods of assessing patient prognosis, such as AJCC, TNM staging, and biochemical markers, are no longer sufficient in the era of precision medicine. Existing tumor models often rely on single selection and simpler algorithms, which can lead to poor effectiveness or overfitting. To address these limitations, this study systematically analyzed RNA-seq high-throughput data and combined 10 machine learning algorithms to construct 117 models. The optimal algorithm combination, StepCox[both] and ridge regression, was identified, and an immune-related gene signature (IRGS) composed of 12 genes was developed. The IRGS demonstrated outstanding predictive performance across multiple datasets and surpassed 10 previously published signatures. GSEA analysis revealed significant enrichment differences in cellular processes, diseases, and immune-related pathways between high- and low-risk recurrence patients. The low recurrence risk group based on IRGS exhibited a stronger immune phenotype and better survival prognosis, which may be associated with higher infiltration of CD4 + and CD8 + T cells. However, high M2 macrophage infiltration suggests potential immune escape in low recurrence risk patients. Combined with immune checkpoint expression levels and TIDE results, it is suggested that low-risk patients may respond positively to immunotherapy. Through drug sensitivity analysis, potential drugs that are more effective for both high- and low-risk groups have been identified. Therefore, the IRGS developed in this study can serve as an adjunct tool for assessing the recurrence risk of breast cancer, potentially enhancing personalized treatment planning, and improving the clinical management of patients with breast cancer.https://doi.org/10.1038/s41598-025-03423-8Data miningPrognosisImmune microenvironmentExpression differenceFunctional enrichment analysisMachine learning
spellingShingle Junyi Li
Shixin Li
Dongpo Zhang
Yibing Zhu
Yue Wang
Xiaoxiao Xing
Juefei Mo
Yong Zhang
Daixiang Liao
Jun Li
Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
Scientific Reports
Data mining
Prognosis
Immune microenvironment
Expression difference
Functional enrichment analysis
Machine learning
title Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
title_full Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
title_fullStr Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
title_full_unstemmed Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
title_short Machine learning-based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
title_sort machine learning based integration develops relapse related signature for predicting prognosis and indicating immune microenvironment infiltration in breast cancer
topic Data mining
Prognosis
Immune microenvironment
Expression difference
Functional enrichment analysis
Machine learning
url https://doi.org/10.1038/s41598-025-03423-8
work_keys_str_mv AT junyili machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT shixinli machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT dongpozhang machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT yibingzhu machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT yuewang machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT xiaoxiaoxing machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT juefeimo machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT yongzhang machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT daixiangliao machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer
AT junli machinelearningbasedintegrationdevelopsrelapserelatedsignatureforpredictingprognosisandindicatingimmunemicroenvironmentinfiltrationinbreastcancer