Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning
Association between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that c...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IEEE
2025-01-01
|
| Series: | IEEE Access |
| Subjects: | |
| Online Access: | https://ieeexplore.ieee.org/document/10963671/ |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1849714558750425088 |
|---|---|
| author | Hansa J. Thattil M. N. Arunkumar Francis Antony |
| author_facet | Hansa J. Thattil M. N. Arunkumar Francis Antony |
| author_sort | Hansa J. Thattil |
| collection | DOAJ |
| description | Association between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that can predict novel gene-disease associations. In this study, we applied a meta-learning approach to predict protein-disease associations with a focus on Alzheimer’s disease. Our approach utilized curated data from the Disease Gene Network Database (DisGeNET), a database of gene-disease associations, and protein-protein interaction data from the Search Tool for Retrieval of Interacting Genes/Proteins (STRING) dataset. We generated embeddings for Protein-Protein Interactions(PPI) using the node2vec algorithm, capturing interaction patterns in a low-dimensional space. HybridBoost, a two-layered model architecture, was then employed, with random forest and CatBoost classifiers in the first layer and CatBoost as the meta-learner in the second layer. Our model achieved an accuracy of 96% and with precision, recall, and F1 scores also outperforming individual models like random forest, support vector machine, AdaBoost, and CatBoost. These results demonstrate the effectiveness of the proposed meta-learning model in predicting protein-disease associations for Alzheimer’s disease. |
| format | Article |
| id | doaj-art-e8847dbdd48442d8920ce511c8096fdf |
| institution | DOAJ |
| issn | 2169-3536 |
| language | English |
| publishDate | 2025-01-01 |
| publisher | IEEE |
| record_format | Article |
| series | IEEE Access |
| spelling | doaj-art-e8847dbdd48442d8920ce511c8096fdf2025-08-20T03:13:40ZengIEEEIEEE Access2169-35362025-01-0113684226843810.1109/ACCESS.2025.356021610963671Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-LearningHansa J. Thattil0https://orcid.org/0009-0005-1317-1567M. N. Arunkumar1https://orcid.org/0000-0003-4487-8053Francis Antony2https://orcid.org/0009-0008-7709-067XResearch Scholar, Computer Science and Engineering, APJ Abdul Kalam Technological University, Thiruvananthapuram, Kerala, IndiaDepartment of Computer Science and Engineering, Federal Institute of Science and Technology, Ernakulam, Angamaly, Kerala, IndiaKPIT Technologies, Kakkanad, Kerala, IndiaAssociation between proteins and diseases have been widely studied to understand disease triggers and identify potential therapeutic targets. Deciphering the complexity of gene networks is crucial for understanding diseases. Node embedding offers a powerful approach, revealing latent patterns that can predict novel gene-disease associations. In this study, we applied a meta-learning approach to predict protein-disease associations with a focus on Alzheimer’s disease. Our approach utilized curated data from the Disease Gene Network Database (DisGeNET), a database of gene-disease associations, and protein-protein interaction data from the Search Tool for Retrieval of Interacting Genes/Proteins (STRING) dataset. We generated embeddings for Protein-Protein Interactions(PPI) using the node2vec algorithm, capturing interaction patterns in a low-dimensional space. HybridBoost, a two-layered model architecture, was then employed, with random forest and CatBoost classifiers in the first layer and CatBoost as the meta-learner in the second layer. Our model achieved an accuracy of 96% and with precision, recall, and F1 scores also outperforming individual models like random forest, support vector machine, AdaBoost, and CatBoost. These results demonstrate the effectiveness of the proposed meta-learning model in predicting protein-disease associations for Alzheimer’s disease.https://ieeexplore.ieee.org/document/10963671/Alzheimer’s diseaseADASYNmeta-learningNode2vecprotein-protein interactionsprotein-disease associations |
| spellingShingle | Hansa J. Thattil M. N. Arunkumar Francis Antony Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning IEEE Access Alzheimer’s disease ADASYN meta-learning Node2vec protein-protein interactions protein-disease associations |
| title | Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning |
| title_full | Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning |
| title_fullStr | Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning |
| title_full_unstemmed | Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning |
| title_short | Enhanced Alzheimer’s Disease Prediction Through Integration of Protein-Protein Interaction Data and Meta-Learning |
| title_sort | enhanced alzheimer x2019 s disease prediction through integration of protein protein interaction data and meta learning |
| topic | Alzheimer’s disease ADASYN meta-learning Node2vec protein-protein interactions protein-disease associations |
| url | https://ieeexplore.ieee.org/document/10963671/ |
| work_keys_str_mv | AT hansajthattil enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning AT mnarunkumar enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning AT francisantony enhancedalzheimerx2019sdiseasepredictionthroughintegrationofproteinproteininteractiondataandmetalearning |