Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.

Gestational alcohol exposure causes fetal alcohol spectrum disorder (FASD) and is a prominent cause of neurodevelopmental disability. Whole transcriptome sequencing (RNA-Seq) offer insights into mechanisms underlying FASD, but gene-level analysis provides limited information regarding complex transc...

Full description

Saved in:
Bibliographic Details
Main Authors: Abrar E Al-Shaer, George R Flentke, Mark E Berres, Ana Garic, Susan M Smith
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2019-04-01
Series:PLoS Computational Biology
Online Access:https://doi.org/10.1371/journal.pcbi.1006937
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849419310037991424
author Abrar E Al-Shaer
George R Flentke
Mark E Berres
Ana Garic
Susan M Smith
author_facet Abrar E Al-Shaer
George R Flentke
Mark E Berres
Ana Garic
Susan M Smith
author_sort Abrar E Al-Shaer
collection DOAJ
description Gestational alcohol exposure causes fetal alcohol spectrum disorder (FASD) and is a prominent cause of neurodevelopmental disability. Whole transcriptome sequencing (RNA-Seq) offer insights into mechanisms underlying FASD, but gene-level analysis provides limited information regarding complex transcriptional processes such as alternative splicing and non-coding RNAs. Moreover, traditional analytical approaches that use multiple hypothesis testing with a false discovery rate adjustment prioritize genes based on an adjusted p-value, which is not always biologically relevant. We address these limitations with a novel approach and implemented an unsupervised machine learning model, which we applied to an exon-level analysis to reduce data complexity to the most likely functionally relevant exons, without loss of novel information. This was performed on an RNA-Seq paired-end dataset derived from alcohol-exposed neural fold-stage chick crania, wherein alcohol causes facial deficits recapitulating those of FASD. A principal component analysis along with k-means clustering was utilized to extract exons that deviated from baseline expression. This identified 6857 differentially expressed exons representing 1251 geneIDs; 391 of these genes were identified in a prior gene-level analysis of this dataset. It also identified exons encoding 23 microRNAs (miRNAs) having significantly differential expression profiles in response to alcohol. We developed an RDAVID pipeline to identify KEGG pathways represented by these exons, and separately identified predicted KEGG pathways targeted by these miRNAs. Several of these (ribosome biogenesis, oxidative phosphorylation) were identified in our prior gene-level analysis. Other pathways are crucial to facial morphogenesis and represent both novel (focal adhesion, FoxO signaling, insulin signaling) and known (Wnt signaling) alcohol targets. Importantly, there was substantial overlap between the exomes themselves and the predicted miRNA targets, suggesting these miRNAs contribute to the gene-level expression changes. Our novel application of unsupervised machine learning in conjunction with statistical analyses facilitated the discovery of signaling pathways and miRNAs that inform mechanisms underlying FASD.
format Article
id doaj-art-c3f9dc3218e74acb8aade7bec71d45a7
institution Kabale University
issn 1553-734X
1553-7358
language English
publishDate 2019-04-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj-art-c3f9dc3218e74acb8aade7bec71d45a72025-08-20T03:32:09ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582019-04-01154e100693710.1371/journal.pcbi.1006937Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.Abrar E Al-ShaerGeorge R FlentkeMark E BerresAna GaricSusan M SmithGestational alcohol exposure causes fetal alcohol spectrum disorder (FASD) and is a prominent cause of neurodevelopmental disability. Whole transcriptome sequencing (RNA-Seq) offer insights into mechanisms underlying FASD, but gene-level analysis provides limited information regarding complex transcriptional processes such as alternative splicing and non-coding RNAs. Moreover, traditional analytical approaches that use multiple hypothesis testing with a false discovery rate adjustment prioritize genes based on an adjusted p-value, which is not always biologically relevant. We address these limitations with a novel approach and implemented an unsupervised machine learning model, which we applied to an exon-level analysis to reduce data complexity to the most likely functionally relevant exons, without loss of novel information. This was performed on an RNA-Seq paired-end dataset derived from alcohol-exposed neural fold-stage chick crania, wherein alcohol causes facial deficits recapitulating those of FASD. A principal component analysis along with k-means clustering was utilized to extract exons that deviated from baseline expression. This identified 6857 differentially expressed exons representing 1251 geneIDs; 391 of these genes were identified in a prior gene-level analysis of this dataset. It also identified exons encoding 23 microRNAs (miRNAs) having significantly differential expression profiles in response to alcohol. We developed an RDAVID pipeline to identify KEGG pathways represented by these exons, and separately identified predicted KEGG pathways targeted by these miRNAs. Several of these (ribosome biogenesis, oxidative phosphorylation) were identified in our prior gene-level analysis. Other pathways are crucial to facial morphogenesis and represent both novel (focal adhesion, FoxO signaling, insulin signaling) and known (Wnt signaling) alcohol targets. Importantly, there was substantial overlap between the exomes themselves and the predicted miRNA targets, suggesting these miRNAs contribute to the gene-level expression changes. Our novel application of unsupervised machine learning in conjunction with statistical analyses facilitated the discovery of signaling pathways and miRNAs that inform mechanisms underlying FASD.https://doi.org/10.1371/journal.pcbi.1006937
spellingShingle Abrar E Al-Shaer
George R Flentke
Mark E Berres
Ana Garic
Susan M Smith
Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
PLoS Computational Biology
title Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
title_full Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
title_fullStr Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
title_full_unstemmed Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
title_short Exon level machine learning analyses elucidate novel candidate miRNA targets in an avian model of fetal alcohol spectrum disorder.
title_sort exon level machine learning analyses elucidate novel candidate mirna targets in an avian model of fetal alcohol spectrum disorder
url https://doi.org/10.1371/journal.pcbi.1006937
work_keys_str_mv AT abrarealshaer exonlevelmachinelearninganalyseselucidatenovelcandidatemirnatargetsinanavianmodeloffetalalcoholspectrumdisorder
AT georgerflentke exonlevelmachinelearninganalyseselucidatenovelcandidatemirnatargetsinanavianmodeloffetalalcoholspectrumdisorder
AT markeberres exonlevelmachinelearninganalyseselucidatenovelcandidatemirnatargetsinanavianmodeloffetalalcoholspectrumdisorder
AT anagaric exonlevelmachinelearninganalyseselucidatenovelcandidatemirnatargetsinanavianmodeloffetalalcoholspectrumdisorder
AT susanmsmith exonlevelmachinelearninganalyseselucidatenovelcandidatemirnatargetsinanavianmodeloffetalalcoholspectrumdisorder