Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning

Association between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that c...

Full description

Saved in:
Bibliographic Details
Main Authors: Hansa J. Thattil, M. N. Arunkumar, Francis Antony
Format: Article
Language:English
Published: IEEE 2025-01-01
Series:IEEE Access
Subjects:
Online Access:https://ieeexplore.ieee.org/document/10963671/
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849714558750425088
author Hansa J. Thattil
M. N. Arunkumar
Francis Antony
author_facet Hansa J. Thattil
M. N. Arunkumar
Francis Antony
author_sort Hansa J. Thattil
collection DOAJ
description Association between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that can predict novel gene-disease associations. In this study, we applied a meta-learning approach to predict protein-disease associations with a focus on Alzheimer’s disease. Our approach utilized curated data from the Disease Gene Network Database (DisGeNET), a database of gene-disease associations, and protein-protein interaction data from the Search Tool for Retrieval of Interacting Genes/Proteins (STRING) dataset. We generated embeddings for Protein-Protein Interactions(PPI) using the node2vec algorithm, capturing interaction patterns in a low-dimensional space. HybridBoost, a two-layered model architecture, was then employed, with random forest and CatBoost classifiers in the first layer and CatBoost as the meta-learner in the second layer. Our model achieved an accuracy of 96% and with precision, recall, and F1 scores also outperforming individual models like random forest, support vector machine, AdaBoost, and CatBoost. These results demonstrate the effectiveness of the proposed meta-learning model in predicting protein-disease associations for Alzheimer’s disease.
format Article
id doaj-art-e8847dbdd48442d8920ce511c8096fdf
institution DOAJ
issn 2169-3536
language English
publishDate 2025-01-01
publisher IEEE
record_format Article
series IEEE Access
spelling doaj-art-e8847dbdd48442d8920ce511c8096fdf2025-08-20T03:13:40ZengIEEEIEEE Access2169-35362025-01-0113684226843810.1109/ACCESS.2025.356021610963671Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-LearningHansa J. Thattil0https://orcid.org/0009-0005-1317-1567M. N. Arunkumar1https://orcid.org/0000-0003-4487-8053Francis Antony2https://orcid.org/0009-0008-7709-067XResearch Scholar, Computer Science and Engineering, APJ Abdul Kalam Technological University, Thiruvananthapuram, Kerala, IndiaDepartment of Computer Science and Engineering, Federal Institute of Science and Technology, Ernakulam, Angamaly, Kerala, IndiaKPIT Technologies, Kakkanad, Kerala, IndiaAssociation between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that can predict novel gene-disease associations. In this study, we applied a meta-learning approach to predict protein-disease associations with a focus on Alzheimer’s disease. Our approach utilized curated data from the Disease Gene Network Database (DisGeNET), a database of gene-disease associations, and protein-protein interaction data from the Search Tool for Retrieval of Interacting Genes/Proteins (STRING) dataset. We generated embeddings for Protein-Protein Interactions(PPI) using the node2vec algorithm, capturing interaction patterns in a low-dimensional space. HybridBoost, a two-layered model architecture, was then employed, with random forest and CatBoost classifiers in the first layer and CatBoost as the meta-learner in the second layer. Our model achieved an accuracy of 96% and with precision, recall, and F1 scores also outperforming individual models like random forest, support vector machine, AdaBoost, and CatBoost. These results demonstrate the effectiveness of the proposed meta-learning model in predicting protein-disease associations for Alzheimer’s disease.https://ieeexplore.ieee.org/document/10963671/Alzheimer’s diseaseADASYNmeta-learningNode2vecprotein-protein interactionsprotein-disease associations
spellingShingle Hansa J. Thattil
M. N. Arunkumar
Francis Antony
Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
IEEE Access
Alzheimer’s disease
ADASYN
meta-learning
Node2vec
protein-protein interactions
protein-disease associations
title Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
title_full Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
title_fullStr Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
title_full_unstemmed Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
title_short Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
title_sort enhanced alzheimer x2019 s disease prediction through integration of protein protein interaction data and meta learning
topic Alzheimer’s disease
ADASYN
meta-learning
Node2vec
protein-protein interactions
protein-disease associations
url https://ieeexplore.ieee.org/document/10963671/
work_keys_str_mv AT hansajthattil enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning
AT mnarunkumar enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning
AT francisantony enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning