Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.

<h4>Background</h4>Diabetic kidney disease (DKD) is a severe global complication of diabetes, yet its molecular mechanisms remain incompletely understood. This study aimed to investigate the role of protein glycosylation in DKD pathogenesis and its association with gene expression change...

Full description

Saved in:
Bibliographic Details
Main Authors: Ziyang Liu, Zengyuan Qin, Wenxin Bai, Shasha Wang, Chunling Huang, Na Li, Lei Yan, Yue Gu, Fengmin Shao
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2025-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0329640
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849228125305569280
author Ziyang Liu
Zengyuan Qin
Wenxin Bai
Shasha Wang
Chunling Huang
Na Li
Lei Yan
Yue Gu
Fengmin Shao
author_facet Ziyang Liu
Zengyuan Qin
Wenxin Bai
Shasha Wang
Chunling Huang
Na Li
Lei Yan
Yue Gu
Fengmin Shao
author_sort Ziyang Liu
collection DOAJ
description <h4>Background</h4>Diabetic kidney disease (DKD) is a severe global complication of diabetes, yet its molecular mechanisms remain incompletely understood. This study aimed to investigate the role of protein glycosylation in DKD pathogenesis and its association with gene expression changes, with the goal of identifying diagnostic biomarkers and personalized therapeutic targets.<h4>Methods</h4>Integrated bioinformatics and machine learning approaches were applied to analyze multiple gene expression datasets. Differentially expressed glycosylation-related genes were identified, followed by unsupervised clustering to define molecular subtypes. Functional enrichment, immune cell infiltration analysis, and machine learning algorithms (including feature selection for hub genes) were employed. qPCR validation was performed on clinical DKD and normal kidney tissues, and ROC curves were generated to assess diagnostic potential.<h4>Results</h4>Unsupervised clustering of glycosylation-related genes revealed two distinct DKD molecular subtypes with differential pathway activation (e.g., extracellular matrix remodeling) and immune infiltration patterns. Six hub genes (S100A12, EXT1, SBSPON, ADAMTS1, FMOD, SPTB) were identified as critical to DKD pathogenesis through machine learning. Immune infiltration analysis showed significant differences in macrophage and neutrophil activity between DKD and controls and Immunohistochemical results confirmed the occurrence of immune infiltration. qPCR validation confirmed dysregulation of hub genes in DKD tissues compared to normal samples. ROC analysis demonstrated high diagnostic accuracy for these genes.<h4>Conclusions</h4>This study highlights abnormal protein glycosylation as a key player in DKD and identifies six hub genes with potential as diagnostic biomarkers. The molecular subtypes and immune infiltration patterns provide insights into disease heterogeneity, paving the way for personalized therapies. Future studies should validate these findings in larger cohorts with explicit sample sizes to strengthen clinical applicability.
format Article
id doaj-art-f78bc3114d3e4e09831cb4faf96c384e
institution Kabale University
issn 1932-6203
language English
publishDate 2025-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-f78bc3114d3e4e09831cb4faf96c384e2025-08-23T05:32:30ZengPublic Library of Science (PLoS)PLoS ONE1932-62032025-01-01208e032964010.1371/journal.pone.0329640Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.Ziyang LiuZengyuan QinWenxin BaiShasha WangChunling HuangNa LiLei YanYue GuFengmin Shao<h4>Background</h4>Diabetic kidney disease (DKD) is a severe global complication of diabetes, yet its molecular mechanisms remain incompletely understood. This study aimed to investigate the role of protein glycosylation in DKD pathogenesis and its association with gene expression changes, with the goal of identifying diagnostic biomarkers and personalized therapeutic targets.<h4>Methods</h4>Integrated bioinformatics and machine learning approaches were applied to analyze multiple gene expression datasets. Differentially expressed glycosylation-related genes were identified, followed by unsupervised clustering to define molecular subtypes. Functional enrichment, immune cell infiltration analysis, and machine learning algorithms (including feature selection for hub genes) were employed. qPCR validation was performed on clinical DKD and normal kidney tissues, and ROC curves were generated to assess diagnostic potential.<h4>Results</h4>Unsupervised clustering of glycosylation-related genes revealed two distinct DKD molecular subtypes with differential pathway activation (e.g., extracellular matrix remodeling) and immune infiltration patterns. Six hub genes (S100A12, EXT1, SBSPON, ADAMTS1, FMOD, SPTB) were identified as critical to DKD pathogenesis through machine learning. Immune infiltration analysis showed significant differences in macrophage and neutrophil activity between DKD and controls and Immunohistochemical results confirmed the occurrence of immune infiltration. qPCR validation confirmed dysregulation of hub genes in DKD tissues compared to normal samples. ROC analysis demonstrated high diagnostic accuracy for these genes.<h4>Conclusions</h4>This study highlights abnormal protein glycosylation as a key player in DKD and identifies six hub genes with potential as diagnostic biomarkers. The molecular subtypes and immune infiltration patterns provide insights into disease heterogeneity, paving the way for personalized therapies. Future studies should validate these findings in larger cohorts with explicit sample sizes to strengthen clinical applicability.https://doi.org/10.1371/journal.pone.0329640
spellingShingle Ziyang Liu
Zengyuan Qin
Wenxin Bai
Shasha Wang
Chunling Huang
Na Li
Lei Yan
Yue Gu
Fengmin Shao
Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
PLoS ONE
title Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
title_full Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
title_fullStr Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
title_full_unstemmed Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
title_short Integrating bioinformatics and machine learning to elucidate the role of protein glycosylation-related genes in the pathogenesis of diabetic kidney disease.
title_sort integrating bioinformatics and machine learning to elucidate the role of protein glycosylation related genes in the pathogenesis of diabetic kidney disease
url https://doi.org/10.1371/journal.pone.0329640
work_keys_str_mv AT ziyangliu integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT zengyuanqin integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT wenxinbai integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT shashawang integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT chunlinghuang integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT nali integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT leiyan integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT yuegu integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease
AT fengminshao integratingbioinformaticsandmachinelearningtoelucidatetheroleofproteinglycosylationrelatedgenesinthepathogenesisofdiabetickidneydisease