IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers

Understanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the...

Full description

Saved in:
Bibliographic Details
Main Authors: Enes Ozelbas, Tuba Sevimoglu, Tamer Kahveci
Format: Article
Language:English
Published: IOP Publishing 2024-01-01
Series:Machine Learning: Science and Technology
Subjects:
Online Access:https://doi.org/10.1088/2632-2153/ad829d
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850205788215181312
author Enes Ozelbas
Tuba Sevimoglu
Tamer Kahveci
author_facet Enes Ozelbas
Tuba Sevimoglu
Tamer Kahveci
author_sort Enes Ozelbas
collection DOAJ
description Understanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the transcriptome behavior of AD. Our approach called IDEEA ( I nformation D iffusion model for integrating gene E xpression and E EG data in identifying A lzheimer’s disease markers) involves systematically linking two different but complementary modalities: transcriptomics and electroencephalogram (EEG) data. We preprocess these two data types by calculating the spectral and transcriptional sample distances, over 11 brain regions encompassing 6 distinct frequency bands. Subsequently, we employ a genetic algorithm approach to integrate the distinct features of the preprocessed data. Our experimental results show that IDEEA converges rapidly to local optima gene subsets, in fewer than 250 iterations. Our algorithm identifies novel genes along with genes that have previously been linked to AD. It is also capable of detecting genes with transcription patterns specific to individual EEG bands as well as those with common patterns among bands. In particular, the alpha2 (10–13 Hz) frequency band yielded 8 AD-associated genes out of the top 100 most frequently selected genes by our algorithm, with a p -value of 0.05. Our method not only identifies AD-related genes but also genes that interact with AD genes in terms of transcription regulation. We evaluated various aspects of our approach, including the genetic algorithm performance, band-pair association and gene interaction topology. Our approach reveals AD-relevant genes with transcription patterns inferred from EEG alone, across various frequency bands, avoiding the risky brain tissue collection process. This is a significant advancement toward the early identification of AD using non-invasive EEG recordings.
format Article
id doaj-art-445d4afbf9ae4cda88b87426218fce02
institution OA Journals
issn 2632-2153
language English
publishDate 2024-01-01
publisher IOP Publishing
record_format Article
series Machine Learning: Science and Technology
spelling doaj-art-445d4afbf9ae4cda88b87426218fce022025-08-20T02:11:00ZengIOP PublishingMachine Learning: Science and Technology2632-21532024-01-015404501610.1088/2632-2153/ad829dIDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markersEnes Ozelbas0https://orcid.org/0000-0003-4665-1952Tuba Sevimoglu1https://orcid.org/0000-0003-4563-3154Tamer Kahveci2https://orcid.org/0000-0002-4403-8612Computer Engineering, Yildiz Technical University , Istanbul, TürkiyeBioengineering, University of Health Sciences , Istanbul, TürkiyeComputer and Information Sciences and Engineering, University of Florida , Gainesville, FL, United States of AmericaUnderstanding the genetic components of Alzheimer’s disease (AD) via transcriptome analysis often necessitates the use of invasive methods. This work focuses on overcoming the difficulties associated with the invasive process of collecting brain tissue samples in order to measure and investigate the transcriptome behavior of AD. Our approach called IDEEA ( I nformation D iffusion model for integrating gene E xpression and E EG data in identifying A lzheimer’s disease markers) involves systematically linking two different but complementary modalities: transcriptomics and electroencephalogram (EEG) data. We preprocess these two data types by calculating the spectral and transcriptional sample distances, over 11 brain regions encompassing 6 distinct frequency bands. Subsequently, we employ a genetic algorithm approach to integrate the distinct features of the preprocessed data. Our experimental results show that IDEEA converges rapidly to local optima gene subsets, in fewer than 250 iterations. Our algorithm identifies novel genes along with genes that have previously been linked to AD. It is also capable of detecting genes with transcription patterns specific to individual EEG bands as well as those with common patterns among bands. In particular, the alpha2 (10–13 Hz) frequency band yielded 8 AD-associated genes out of the top 100 most frequently selected genes by our algorithm, with a p -value of 0.05. Our method not only identifies AD-related genes but also genes that interact with AD genes in terms of transcription regulation. We evaluated various aspects of our approach, including the genetic algorithm performance, band-pair association and gene interaction topology. Our approach reveals AD-relevant genes with transcription patterns inferred from EEG alone, across various frequency bands, avoiding the risky brain tissue collection process. This is a significant advancement toward the early identification of AD using non-invasive EEG recordings.https://doi.org/10.1088/2632-2153/ad829dAlzheimer’s diseaseelectroencephalogramgene expressionintegrative machine learning
spellingShingle Enes Ozelbas
Tuba Sevimoglu
Tamer Kahveci
IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
Machine Learning: Science and Technology
Alzheimer’s disease
electroencephalogram
gene expression
integrative machine learning
title IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
title_full IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
title_fullStr IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
title_full_unstemmed IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
title_short IDEEA: information diffusion model for integrating gene expression and EEG data in identifying Alzheimer’s disease markers
title_sort ideea information diffusion model for integrating gene expression and eeg data in identifying alzheimer s disease markers
topic Alzheimer’s disease
electroencephalogram
gene expression
integrative machine learning
url https://doi.org/10.1088/2632-2153/ad829d
work_keys_str_mv AT enesozelbas ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers
AT tubasevimoglu ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers
AT tamerkahveci ideeainformationdiffusionmodelforintegratinggeneexpressionandeegdatainidentifyingalzheimersdiseasemarkers