Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer

Abstract Colorectal cancer (CRC) is the most common malignancy in the digestive system, with a lower 5-year overall survival rate. There is increasing evidence showing that RNA modification regulators such as m1A, m5C, m6A, and m7G play crucial roles in tumor progression. However, the prognostic rol...

Full description

Saved in:
Bibliographic Details
Main Authors: Hao Wei, Qingsong Luo, Weimin Zhong
Format: Article
Language:English
Published: Nature Portfolio 2025-02-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-025-89944-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849715829849980928
author Hao Wei
Qingsong Luo
Weimin Zhong
author_facet Hao Wei
Qingsong Luo
Weimin Zhong
author_sort Hao Wei
collection DOAJ
description Abstract Colorectal cancer (CRC) is the most common malignancy in the digestive system, with a lower 5-year overall survival rate. There is increasing evidence showing that RNA modification regulators such as m1A, m5C, m6A, and m7G play crucial roles in tumor progression. However, the prognostic role of integrated m6A/m5C/m1A/m7G methylation modifications in CRC has not been reported and requires further investigation. Five cohorts with 989 samples were first retrieved from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Then, Three m6A/m1A/m5C/m7G-associated molecular subtypes were identified in the TCGA cohort via the consensus clustering analysis, and 1710 co-expression module genes associated with subtypes were obtained from weighted gene co-expression network analysis (WGCNA) results. After conducting univariate Cox analysis in each cohort and retaining common genes, an RNA methylation-related signature (RMS) was developed through the combination of 101 algorithms. The RMS exhibited strong accuracy and robustness in predicting survival outcomes across distinct cohorts (TCGA, GSE17536, GSE17537, GSE29612, and GSE38832) and demonstrated good performance compared with previously reported risk signatures. Additionally, the RMS was identified as an independent prognostic factor for overall survival in the TCGA, GSE17536, GSE17537, GSE29612, and GSE38832 cohorts. The patients were then stratified into high and low-risk groups based on the median risk score across the five cohorts. Compared to the high-risk groups, the low-risk group showed an increased immune cell infiltration level and showed more benefit from immunotherapy and chemotherapy drugs. Moreover, six drugs (KU-0063794, temozolomide, DNMDP, ML162, SJ-172550, ML050) from the Cancer Therapeutics Response Portal (CTRP) and five drugs (BIBX-1382, lomitapide, ZLN005, PPT, panobinostat) from the PRSM database were identified for the high-risk group patients. By integrating data from the TCGA database and the Cancer Cell Line Encyclopedia (CCLE) database, a potential therapeutic target named TERT was identified for the high-risk group of patients. The single-cell results indicated that TERT was highly expressed in epithelial cells. Overall, our developed RMS can accurately predict patients survival outcomes and immunotherapy response, indicating promising application in clinical practice. These findings may offer guidance for the prognosis and personalized treatment of CRC.
format Article
id doaj-art-e7f618132b0e458f879e4954499fd161
institution DOAJ
issn 2045-2322
language English
publishDate 2025-02-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-e7f618132b0e458f879e4954499fd1612025-08-20T03:13:12ZengNature PortfolioScientific Reports2045-23222025-02-0115111410.1038/s41598-025-89944-8Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancerHao Wei0Qingsong Luo1Weimin Zhong2Clinical Laboratory, Guangyuan Central HospitalClinical Laboratory, Guangyuan Central HospitalCentral Laboratory, The Fifth Hospital of XiamenAbstract Colorectal cancer (CRC) is the most common malignancy in the digestive system, with a lower 5-year overall survival rate. There is increasing evidence showing that RNA modification regulators such as m1A, m5C, m6A, and m7G play crucial roles in tumor progression. However, the prognostic role of integrated m6A/m5C/m1A/m7G methylation modifications in CRC has not been reported and requires further investigation. Five cohorts with 989 samples were first retrieved from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Then, Three m6A/m1A/m5C/m7G-associated molecular subtypes were identified in the TCGA cohort via the consensus clustering analysis, and 1710 co-expression module genes associated with subtypes were obtained from weighted gene co-expression network analysis (WGCNA) results. After conducting univariate Cox analysis in each cohort and retaining common genes, an RNA methylation-related signature (RMS) was developed through the combination of 101 algorithms. The RMS exhibited strong accuracy and robustness in predicting survival outcomes across distinct cohorts (TCGA, GSE17536, GSE17537, GSE29612, and GSE38832) and demonstrated good performance compared with previously reported risk signatures. Additionally, the RMS was identified as an independent prognostic factor for overall survival in the TCGA, GSE17536, GSE17537, GSE29612, and GSE38832 cohorts. The patients were then stratified into high and low-risk groups based on the median risk score across the five cohorts. Compared to the high-risk groups, the low-risk group showed an increased immune cell infiltration level and showed more benefit from immunotherapy and chemotherapy drugs. Moreover, six drugs (KU-0063794, temozolomide, DNMDP, ML162, SJ-172550, ML050) from the Cancer Therapeutics Response Portal (CTRP) and five drugs (BIBX-1382, lomitapide, ZLN005, PPT, panobinostat) from the PRSM database were identified for the high-risk group patients. By integrating data from the TCGA database and the Cancer Cell Line Encyclopedia (CCLE) database, a potential therapeutic target named TERT was identified for the high-risk group of patients. The single-cell results indicated that TERT was highly expressed in epithelial cells. Overall, our developed RMS can accurately predict patients survival outcomes and immunotherapy response, indicating promising application in clinical practice. These findings may offer guidance for the prognosis and personalized treatment of CRC.https://doi.org/10.1038/s41598-025-89944-8Colorectal cancerRNA modificationMachine learningImmune cellImmunotherapyCandidate drugs
spellingShingle Hao Wei
Qingsong Luo
Weimin Zhong
Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
Scientific Reports
Colorectal cancer
RNA modification
Machine learning
Immune cell
Immunotherapy
Candidate drugs
title Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
title_full Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
title_fullStr Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
title_full_unstemmed Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
title_short Integration of 101 machine learning algorithm combinations to unveil m6A/m1A/m5C/m7G-associated prognostic signature in colorectal cancer
title_sort integration of 101 machine learning algorithm combinations to unveil m6a m1a m5c m7g associated prognostic signature in colorectal cancer
topic Colorectal cancer
RNA modification
Machine learning
Immune cell
Immunotherapy
Candidate drugs
url https://doi.org/10.1038/s41598-025-89944-8
work_keys_str_mv AT haowei integrationof101machinelearningalgorithmcombinationstounveilm6am1am5cm7gassociatedprognosticsignatureincolorectalcancer
AT qingsongluo integrationof101machinelearningalgorithmcombinationstounveilm6am1am5cm7gassociatedprognosticsignatureincolorectalcancer
AT weiminzhong integrationof101machinelearningalgorithmcombinationstounveilm6am1am5cm7gassociatedprognosticsignatureincolorectalcancer