IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
Understanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the...
Saved in:
| Main Authors: | , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
IOP Publishing
2024-01-01
|
| Series: | Machine Learning: Science and Technology |
| Subjects: | |
| Online Access: | https://doi.org/10.1088/2632-2153/ad829d |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850205788215181312 |
|---|---|
| author | Enes Ozelbas Tuba Sevimoglu Tamer Kahveci |
| author_facet | Enes Ozelbas Tuba Sevimoglu Tamer Kahveci |
| author_sort | Enes Ozelbas |
| collection | DOAJ |
| description | Understanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the transcriptome behavior of AD. Our approach called IDEEA ( I nformation D iffusion model for integrating gene E xpression and E EG data in identifying A lzheimer’s disease markers) involves systematically linking two different but complementary modalities: transcriptomics and electroencephalogram (EEG) data. We preprocess these two data types by calculating the spectral and transcriptional sample distances, over 11 brain regions encompassing 6 distinct frequency bands. Subsequently, we employ a genetic algorithm approach to integrate the distinct features of the preprocessed data. Our experimental results show that IDEEA converges rapidly to local optima gene subsets, in fewer than 250 iterations. Our algorithm identifies novel genes along with genes that have previously been linked to AD. It is also capable of detecting genes with transcription patterns specific to individual EEG bands as well as those with common patterns among bands. In particular, the alpha2 (10–13 Hz) frequency band yielded 8 AD-associated genes out of the top 100 most frequently selected genes by our algorithm, with a p -value of 0.05. Our method not only identifies AD-related genes but also genes that interact with AD genes in terms of transcription regulation. We evaluated various aspects of our approach, including the genetic algorithm performance, band-pair association and gene interaction topology. Our approach reveals AD-relevant genes with transcription patterns inferred from EEG alone, across various frequency bands, avoiding the risky brain tissue collection process. This is a significant advancement toward the early identification of AD using non-invasive EEG recordings. |
| format | Article |
| id | doaj-art-445d4afbf9ae4cda88b87426218fce02 |
| institution | OA Journals |
| issn | 2632-2153 |
| language | English |
| publishDate | 2024-01-01 |
| publisher | IOP Publishing |
| record_format | Article |
| series | Machine Learning: Science and Technology |
| spelling | doaj-art-445d4afbf9ae4cda88b87426218fce022025-08-20T02:11:00ZengIOP PublishingMachine Learning: Science and Technology2632-21532024-01-015404501610.1088/2632-2153/ad829dIDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markersEnes Ozelbas0https://orcid.org/0000-0003-4665-1952Tuba Sevimoglu1https://orcid.org/0000-0003-4563-3154Tamer Kahveci2https://orcid.org/0000-0002-4403-8612Computer Engineering, Yildiz Technical University , Istanbul, TürkiyeBioengineering, University of Health Sciences , Istanbul, TürkiyeComputer and Information Sciences and Engineering, University of Florida , Gainesville, FL, United States of AmericaUnderstanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the transcriptome behavior of AD. Our approach called IDEEA ( I nformation D iffusion model for integrating gene E xpression and E EG data in identifying A lzheimer’s disease markers) involves systematically linking two different but complementary modalities: transcriptomics and electroencephalogram (EEG) data. We preprocess these two data types by calculating the spectral and transcriptional sample distances, over 11 brain regions encompassing 6 distinct frequency bands. Subsequently, we employ a genetic algorithm approach to integrate the distinct features of the preprocessed data. Our experimental results show that IDEEA converges rapidly to local optima gene subsets, in fewer than 250 iterations. Our algorithm identifies novel genes along with genes that have previously been linked to AD. It is also capable of detecting genes with transcription patterns specific to individual EEG bands as well as those with common patterns among bands. In particular, the alpha2 (10–13 Hz) frequency band yielded 8 AD-associated genes out of the top 100 most frequently selected genes by our algorithm, with a p -value of 0.05. Our method not only identifies AD-related genes but also genes that interact with AD genes in terms of transcription regulation. We evaluated various aspects of our approach, including the genetic algorithm performance, band-pair association and gene interaction topology. Our approach reveals AD-relevant genes with transcription patterns inferred from EEG alone, across various frequency bands, avoiding the risky brain tissue collection process. This is a significant advancement toward the early identification of AD using non-invasive EEG recordings.https://doi.org/10.1088/2632-2153/ad829dAlzheimer’s diseaseelectroencephalogramgene expressionintegrative machine learning |
| spellingShingle | Enes Ozelbas Tuba Sevimoglu Tamer Kahveci IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers Machine Learning: Science and Technology Alzheimer’s disease electroencephalogram gene expression integrative machine learning |
| title | IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers |
| title_full | IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers |
| title_fullStr | IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers |
| title_full_unstemmed | IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers |
| title_short | IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers |
| title_sort | ideea information diffusion model for integrating gene expression and eeg data in identifying alzheimer s disease markers |
| topic | Alzheimer’s disease electroencephalogram gene expression integrative machine learning |
| url | https://doi.org/10.1088/2632-2153/ad829d |
| work_keys_str_mv | AT enesozelbas ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers AT tubasevimoglu ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers AT tamerkahveci ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers |