Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning

Abstract Colorectal polyps are precursors of colorectal cancer. Metabolic dysfunction associated steatohepatitis (MASH) is one of metabolic dysfunction associated fatty liver disease (MAFLD) phenotypic manifestations. Much evidence has suggested an association between MASH and polyps. This study inv...

Full description

Saved in:
Bibliographic Details
Main Authors: Ying Geng, Yifang Li, Ge Liu, Jian Jiao
Format: Article
Language:English
Published: Nature Portfolio 2024-11-01
Series:Scientific Reports
Subjects:
Online Access:https://doi.org/10.1038/s41598-024-81120-8
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850216366439661568
author Ying Geng
Yifang Li
Ge Liu
Jian Jiao
author_facet Ying Geng
Yifang Li
Ge Liu
Jian Jiao
author_sort Ying Geng
collection DOAJ
description Abstract Colorectal polyps are precursors of colorectal cancer. Metabolic dysfunction associated steatohepatitis (MASH) is one of metabolic dysfunction associated fatty liver disease (MAFLD) phenotypic manifestations. Much evidence has suggested an association between MASH and polyps. This study investigated the biomarkers of MASH and colorectal polyps, and the prediction of targeted drugs using an integrated bioinformatics analysis method. Differentially expressed genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA) were performed on GSE89632 and GSE41258 datasets, 49 shared genes revealed after intersection. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses depicted they were mainly enriched in apoptosis, proliferation and infection pathways. Machine learning algorithms identified S100P, FOXO1, and LPAR1 were biomarkers for colorectal polyps and MASH, ROC curve and violin plot showed ideal AUC and stable expression patterns in both the discovery and validation sets. GSEA analysis showed significant enrichment of bile acid and fatty acid pathways when grouped by the expression levels of the three candidate biomarkers. Immune infiltration analysis showed a significant infiltration of M0 macrophages and Treg cells in the colorectal polyps group. A total of 9 small molecule compounds were considered as potential chemoprevention agents in MASH and colorectal polyps by using the CMap website. Using integrated bioinformatics analysis, the molecular mechanism between MASH and colorectal polyps has been further explored.
format Article
id doaj-art-c174f32656ac4bc5b7c1beb973db74b0
institution OA Journals
issn 2045-2322
language English
publishDate 2024-11-01
publisher Nature Portfolio
record_format Article
series Scientific Reports
spelling doaj-art-c174f32656ac4bc5b7c1beb973db74b02025-08-20T02:08:19ZengNature PortfolioScientific Reports2045-23222024-11-0114111710.1038/s41598-024-81120-8Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learningYing Geng0Yifang Li1Ge Liu2Jian Jiao3Department of Gastroenterology and Hepatology, China-Japan Union Hospital, Jilin UniversityDepartment of Gastroenterology and Hepatology, China-Japan Union Hospital, Jilin UniversityDepartment of Gastroenterology and Hepatology, China-Japan Union Hospital, Jilin UniversityDepartment of Gastroenterology and Hepatology, China-Japan Union Hospital, Jilin UniversityAbstract Colorectal polyps are precursors of colorectal cancer. Metabolic dysfunction associated steatohepatitis (MASH) is one of metabolic dysfunction associated fatty liver disease (MAFLD) phenotypic manifestations. Much evidence has suggested an association between MASH and polyps. This study investigated the biomarkers of MASH and colorectal polyps, and the prediction of targeted drugs using an integrated bioinformatics analysis method. Differentially expressed genes (DEGs) analysis and weighted gene co-expression network analysis (WGCNA) were performed on GSE89632 and GSE41258 datasets, 49 shared genes revealed after intersection. The Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment analyses depicted they were mainly enriched in apoptosis, proliferation and infection pathways. Machine learning algorithms identified S100P, FOXO1, and LPAR1 were biomarkers for colorectal polyps and MASH, ROC curve and violin plot showed ideal AUC and stable expression patterns in both the discovery and validation sets. GSEA analysis showed significant enrichment of bile acid and fatty acid pathways when grouped by the expression levels of the three candidate biomarkers. Immune infiltration analysis showed a significant infiltration of M0 macrophages and Treg cells in the colorectal polyps group. A total of 9 small molecule compounds were considered as potential chemoprevention agents in MASH and colorectal polyps by using the CMap website. Using integrated bioinformatics analysis, the molecular mechanism between MASH and colorectal polyps has been further explored.https://doi.org/10.1038/s41598-024-81120-8Colorectal polypsMetabolic dysfunction associated steatohepatitis (MASH)Bioinformatics analysisMachine learningBile acids
spellingShingle Ying Geng
Yifang Li
Ge Liu
Jian Jiao
Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
Scientific Reports
Colorectal polyps
Metabolic dysfunction associated steatohepatitis (MASH)
Bioinformatics analysis
Machine learning
Bile acids
title Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
title_full Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
title_fullStr Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
title_full_unstemmed Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
title_short Identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction-associated steatohepatitis (MASH) by bioinformatics analysis and machine learning
title_sort identification of biomarkers for the diagnosis in colorectal polyps and metabolic dysfunction associated steatohepatitis mash by bioinformatics analysis and machine learning
topic Colorectal polyps
Metabolic dysfunction associated steatohepatitis (MASH)
Bioinformatics analysis
Machine learning
Bile acids
url https://doi.org/10.1038/s41598-024-81120-8
work_keys_str_mv AT yinggeng identificationofbiomarkersforthediagnosisincolorectalpolypsandmetabolicdysfunctionassociatedsteatohepatitismashbybioinformaticsanalysisandmachinelearning
AT yifangli identificationofbiomarkersforthediagnosisincolorectalpolypsandmetabolicdysfunctionassociatedsteatohepatitismashbybioinformaticsanalysisandmachinelearning
AT geliu identificationofbiomarkersforthediagnosisincolorectalpolypsandmetabolicdysfunctionassociatedsteatohepatitismashbybioinformaticsanalysisandmachinelearning
AT jianjiao identificationofbiomarkersforthediagnosisincolorectalpolypsandmetabolicdysfunctionassociatedsteatohepatitismashbybioinformaticsanalysisandmachinelearning