Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets

Abstract Microbiome research has revealed associations between microbial species and colorectal cancer (CRC). Most of the existing research relied on metagenomic data. We leveraged a tool that we recently developed for detecting human and microbial peptides from (meta)proteomics data to reanalyze Cl...

Full description

Saved in:
Bibliographic Details
Main Authors: Jamie Canderan, Yuzhen Ye
Format: Article
Language:English
Published: Nature Portfolio 2025-04-01
Series:Scientific Reports
Online Access:https://doi.org/10.1038/s41598-025-97984-3
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849713091859709952
author Jamie Canderan
Yuzhen Ye
author_facet Jamie Canderan
Yuzhen Ye
author_sort Jamie Canderan
collection DOAJ
description Abstract Microbiome research has revealed associations between microbial species and colorectal cancer (CRC). Most of the existing research relied on metagenomic data. We leveraged a tool that we recently developed for detecting human and microbial peptides from (meta)proteomics data to reanalyze Clinical Proteomic Tumor Analysis Consortium CRC proteomics datasets. Our analyses revealed potential microbial species and proteins that are associated with CRC, especially when analyzing multiplexed proteomics data consisting of cancerous and healthy tissue taken from the same individuals. Many of the identified proteins are associated with species with known links to CRC, such as the fungi Aspergillus kawachii, but many are unstudied or their specific roles unknown. Proteins from other microbial species, such as Paenibacillus cellulosilyticus, were also identified in the samples. We showed that Aspergillus kawachii and others are depleted overall in cancer samples, which is consistent with a previous genomic-based multi-cohort study. Our analysis also revealed that some proteins belonging to this species are more abundantly detected, while others in this and other species are not. Further, we showed that microbial identifications could be used to build predictive models for tumor detection, but caution needs to be taken when applying such models trained on one dataset to another due to the substantial impacts of different experimental techniques on peptide detection profiles.
format Article
id doaj-art-b2e06224e89840bea90143461cb9e32b
institution DOAJ
issn 2045-2322
language English
publishDate 2025-04-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-b2e06224e89840bea90143461cb9e32b2025-08-20T03:14:03ZengNature PortfolioScientific Reports2045-23222025-04-0115111110.1038/s41598-025-97984-3Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasetsJamie Canderan0Yuzhen Ye1Luddy School of Informatics, Computing and Engineering, Indiana UniversityLuddy School of Informatics, Computing and Engineering, Indiana UniversityAbstract Microbiome research has revealed associations between microbial species and colorectal cancer (CRC). Most of the existing research relied on metagenomic data. We leveraged a tool that we recently developed for detecting human and microbial peptides from (meta)proteomics data to reanalyze Clinical Proteomic Tumor Analysis Consortium CRC proteomics datasets. Our analyses revealed potential microbial species and proteins that are associated with CRC, especially when analyzing multiplexed proteomics data consisting of cancerous and healthy tissue taken from the same individuals. Many of the identified proteins are associated with species with known links to CRC, such as the fungi Aspergillus kawachii, but many are unstudied or their specific roles unknown. Proteins from other microbial species, such as Paenibacillus cellulosilyticus, were also identified in the samples. We showed that Aspergillus kawachii and others are depleted overall in cancer samples, which is consistent with a previous genomic-based multi-cohort study. Our analysis also revealed that some proteins belonging to this species are more abundantly detected, while others in this and other species are not. Further, we showed that microbial identifications could be used to build predictive models for tumor detection, but caution needs to be taken when applying such models trained on one dataset to another due to the substantial impacts of different experimental techniques on peptide detection profiles.https://doi.org/10.1038/s41598-025-97984-3
spellingShingle Jamie Canderan
Yuzhen Ye
Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
Scientific Reports
title Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
title_full Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
title_fullStr Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
title_full_unstemmed Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
title_short Identification of microbial species and proteins associated with colorectal cancer by reanalyzing CPTAC proteomic datasets
title_sort identification of microbial species and proteins associated with colorectal cancer by reanalyzing cptac proteomic datasets
url https://doi.org/10.1038/s41598-025-97984-3
work_keys_str_mv AT jamiecanderan identificationofmicrobialspeciesandproteinsassociatedwithcolorectalcancerbyreanalyzingcptacproteomicdatasets
AT yuzhenye identificationofmicrobialspeciesandproteinsassociatedwithcolorectalcancerbyreanalyzingcptacproteomicdatasets