A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.

The increasing quantity and quality of functional genomic information motivate the assessment and integration of these data with association data, including data originating from genome-wide association studies (GWAS). We used previously described GWAS signals ("hits") to train a regulariz...

Full description

Saved in:
Bibliographic Details
Main Authors: Sarah A Gagliano, Michael R Barnes, Michael E Weale, Jo Knight
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2014-01-01
Series:PLoS ONE
Online Access:https://doi.org/10.1371/journal.pone.0098122
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850124877973946368
author Sarah A Gagliano
Michael R Barnes
Michael E Weale
Jo Knight
author_facet Sarah A Gagliano
Michael R Barnes
Michael E Weale
Jo Knight
author_sort Sarah A Gagliano
collection DOAJ
description The increasing quantity and quality of functional genomic information motivate the assessment and integration of these data with association data, including data originating from genome-wide association studies (GWAS). We used previously described GWAS signals ("hits") to train a regularized logistic model in order to predict SNP causality on the basis of a large multivariate functional dataset. We show how this model can be used to derive Bayes factors for integrating functional and association data into a combined Bayesian analysis. Functional characteristics were obtained from the Encyclopedia of DNA Elements (ENCODE), from published expression quantitative trait loci (eQTL), and from other sources of genome-wide characteristics. We trained the model using all GWAS signals combined, and also using phenotype specific signals for autoimmune, brain-related, cancer, and cardiovascular disorders. The non-phenotype specific and the autoimmune GWAS signals gave the most reliable results. We found SNPs with higher probabilities of causality from functional characteristics showed an enrichment of more significant p-values compared to all GWAS SNPs in three large GWAS studies of complex traits. We investigated the ability of our Bayesian method to improve the identification of true causal signals in a psoriasis GWAS dataset and found that combining functional data with association data improves the ability to prioritise novel hits. We used the predictions from the penalized logistic regression model to calculate Bayes factors relating to functional characteristics and supply these online alongside resources to integrate these data with association data.
format Article
id doaj-art-dea5727b86e64406a5dada1990ec3ec6
institution OA Journals
issn 1932-6203
language English
publishDate 2014-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-dea5727b86e64406a5dada1990ec3ec62025-08-20T02:34:13ZengPublic Library of Science (PLoS)PLoS ONE1932-62032014-01-0195e9812210.1371/journal.pone.0098122A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.Sarah A GaglianoMichael R BarnesMichael E WealeJo KnightThe increasing quantity and quality of functional genomic information motivate the assessment and integration of these data with association data, including data originating from genome-wide association studies (GWAS). We used previously described GWAS signals ("hits") to train a regularized logistic model in order to predict SNP causality on the basis of a large multivariate functional dataset. We show how this model can be used to derive Bayes factors for integrating functional and association data into a combined Bayesian analysis. Functional characteristics were obtained from the Encyclopedia of DNA Elements (ENCODE), from published expression quantitative trait loci (eQTL), and from other sources of genome-wide characteristics. We trained the model using all GWAS signals combined, and also using phenotype specific signals for autoimmune, brain-related, cancer, and cardiovascular disorders. The non-phenotype specific and the autoimmune GWAS signals gave the most reliable results. We found SNPs with higher probabilities of causality from functional characteristics showed an enrichment of more significant p-values compared to all GWAS SNPs in three large GWAS studies of complex traits. We investigated the ability of our Bayesian method to improve the identification of true causal signals in a psoriasis GWAS dataset and found that combining functional data with association data improves the ability to prioritise novel hits. We used the predictions from the penalized logistic regression model to calculate Bayes factors relating to functional characteristics and supply these online alongside resources to integrate these data with association data.https://doi.org/10.1371/journal.pone.0098122
spellingShingle Sarah A Gagliano
Michael R Barnes
Michael E Weale
Jo Knight
A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
PLoS ONE
title A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
title_full A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
title_fullStr A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
title_full_unstemmed A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
title_short A Bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization.
title_sort bayesian method to incorporate hundreds of functional characteristics with association evidence to improve variant prioritization
url https://doi.org/10.1371/journal.pone.0098122
work_keys_str_mv AT sarahagagliano abayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT michaelrbarnes abayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT michaeleweale abayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT joknight abayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT sarahagagliano bayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT michaelrbarnes bayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT michaeleweale bayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization
AT joknight bayesianmethodtoincorporatehundredsoffunctionalcharacteristicswithassociationevidencetoimprovevariantprioritization