Mining the human phenome using allelic scores that index biological intermediates.

It is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aet...

Full description

Saved in:
Bibliographic Details
Main Authors: David M Evans, Marie Jo A Brion, Lavinia Paternoster, John P Kemp, George McMahon, Marcus Munafò, John B Whitfield, Sarah E Medland, Grant W Montgomery, GIANT Consortium, CRP Consortium, TAG Consortium, Nicholas J Timpson, Beate St Pourcain, Debbie A Lawlor, Nicholas G Martin, Abbas Dehghan, Joel Hirschhorn, George Davey Smith
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2013-10-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1003919
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850161468144615424
author David M Evans
Marie Jo A Brion
Lavinia Paternoster
John P Kemp
George McMahon
Marcus Munafò
John B Whitfield
Sarah E Medland
Grant W Montgomery
GIANT Consortium
CRP Consortium
TAG Consortium
Nicholas J Timpson
Beate St Pourcain
Debbie A Lawlor
Nicholas G Martin
Abbas Dehghan
Joel Hirschhorn
George Davey Smith
author_facet David M Evans
Marie Jo A Brion
Lavinia Paternoster
John P Kemp
George McMahon
Marcus Munafò
John B Whitfield
Sarah E Medland
Grant W Montgomery
GIANT Consortium
CRP Consortium
TAG Consortium
Nicholas J Timpson
Beate St Pourcain
Debbie A Lawlor
Nicholas G Martin
Abbas Dehghan
Joel Hirschhorn
George Davey Smith
author_sort David M Evans
collection DOAJ
description It is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aetiology. However, single genetic variants typically explain small amounts of disease risk. Our idea is to construct allelic scores that explain greater proportions of the variance in biological intermediates, and subsequently use these scores to data mine GWAS. To investigate the approach's properties, we indexed three biological intermediates where the results of large GWAS meta-analyses were available: body mass index, C-reactive protein and low density lipoprotein levels. We generated allelic scores in the Avon Longitudinal Study of Parents and Children, and in publicly available data from the first Wellcome Trust Case Control Consortium. We compared the explanatory ability of allelic scores in terms of their capacity to proxy for the intermediate of interest, and the extent to which they associated with disease. We found that allelic scores derived from known variants and allelic scores derived from hundreds of thousands of genetic markers explained significant portions of the variance in biological intermediates of interest, and many of these scores showed expected correlations with disease. Genome-wide allelic scores however tended to lack specificity suggesting that they should be used with caution and perhaps only to proxy biological intermediates for which there are no known individual variants. Power calculations confirm the feasibility of extending our strategy to the analysis of tens of thousands of molecular phenotypes in large genome-wide meta-analyses. We conclude that our method represents a simple way in which potentially tens of thousands of molecular phenotypes could be screened for causal relationships with disease without having to expensively measure these variables in individual disease collections.
format Article
id doaj-art-c03a59d49387465eac5455262ccdf1bd
institution OA Journals
issn 1553-7390
1553-7404
language English
publishDate 2013-10-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj-art-c03a59d49387465eac5455262ccdf1bd2025-08-20T02:22:49ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042013-10-01910e100391910.1371/journal.pgen.1003919Mining the human phenome using allelic scores that index biological intermediates.David M EvansMarie Jo A BrionLavinia PaternosterJohn P KempGeorge McMahonMarcus MunafòJohn B WhitfieldSarah E MedlandGrant W MontgomeryGIANT ConsortiumCRP ConsortiumTAG ConsortiumNicholas J TimpsonBeate St PourcainDebbie A LawlorNicholas G MartinAbbas DehghanJoel HirschhornGeorge Davey SmithIt is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aetiology. However, single genetic variants typically explain small amounts of disease risk. Our idea is to construct allelic scores that explain greater proportions of the variance in biological intermediates, and subsequently use these scores to data mine GWAS. To investigate the approach's properties, we indexed three biological intermediates where the results of large GWAS meta-analyses were available: body mass index, C-reactive protein and low density lipoprotein levels. We generated allelic scores in the Avon Longitudinal Study of Parents and Children, and in publicly available data from the first Wellcome Trust Case Control Consortium. We compared the explanatory ability of allelic scores in terms of their capacity to proxy for the intermediate of interest, and the extent to which they associated with disease. We found that allelic scores derived from known variants and allelic scores derived from hundreds of thousands of genetic markers explained significant portions of the variance in biological intermediates of interest, and many of these scores showed expected correlations with disease. Genome-wide allelic scores however tended to lack specificity suggesting that they should be used with caution and perhaps only to proxy biological intermediates for which there are no known individual variants. Power calculations confirm the feasibility of extending our strategy to the analysis of tens of thousands of molecular phenotypes in large genome-wide meta-analyses. We conclude that our method represents a simple way in which potentially tens of thousands of molecular phenotypes could be screened for causal relationships with disease without having to expensively measure these variables in individual disease collections.https://doi.org/10.1371/journal.pgen.1003919
spellingShingle David M Evans
Marie Jo A Brion
Lavinia Paternoster
John P Kemp
George McMahon
Marcus Munafò
John B Whitfield
Sarah E Medland
Grant W Montgomery
GIANT Consortium
CRP Consortium
TAG Consortium
Nicholas J Timpson
Beate St Pourcain
Debbie A Lawlor
Nicholas G Martin
Abbas Dehghan
Joel Hirschhorn
George Davey Smith
Mining the human phenome using allelic scores that index biological intermediates.
PLoS Genetics
title Mining the human phenome using allelic scores that index biological intermediates.
title_full Mining the human phenome using allelic scores that index biological intermediates.
title_fullStr Mining the human phenome using allelic scores that index biological intermediates.
title_full_unstemmed Mining the human phenome using allelic scores that index biological intermediates.
title_short Mining the human phenome using allelic scores that index biological intermediates.
title_sort mining the human phenome using allelic scores that index biological intermediates
url https://doi.org/10.1371/journal.pgen.1003919
work_keys_str_mv AT davidmevans miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT mariejoabrion miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT laviniapaternoster miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT johnpkemp miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT georgemcmahon miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT marcusmunafo miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT johnbwhitfield miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT sarahemedland miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT grantwmontgomery miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT giantconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT crpconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT tagconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT nicholasjtimpson miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT beatestpourcain miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT debbiealawlor miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT nicholasgmartin miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT abbasdehghan miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT joelhirschhorn miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates
AT georgedaveysmith miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates