Mining the human phenome using allelic scores that index biological intermediates.
It is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aet...
Saved in:
| Main Authors: | , , , , , , , , , , , , , , , , , , |
|---|---|
| Format: | Article |
| Language: | English |
| Published: |
Public Library of Science (PLoS)
2013-10-01
|
| Series: | PLoS Genetics |
| Online Access: | https://doi.org/10.1371/journal.pgen.1003919 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1850161468144615424 |
|---|---|
| author | David M Evans Marie Jo A Brion Lavinia Paternoster John P Kemp George McMahon Marcus Munafò John B Whitfield Sarah E Medland Grant W Montgomery GIANT Consortium CRP Consortium TAG Consortium Nicholas J Timpson Beate St Pourcain Debbie A Lawlor Nicholas G Martin Abbas Dehghan Joel Hirschhorn George Davey Smith |
| author_facet | David M Evans Marie Jo A Brion Lavinia Paternoster John P Kemp George McMahon Marcus Munafò John B Whitfield Sarah E Medland Grant W Montgomery GIANT Consortium CRP Consortium TAG Consortium Nicholas J Timpson Beate St Pourcain Debbie A Lawlor Nicholas G Martin Abbas Dehghan Joel Hirschhorn George Davey Smith |
| author_sort | David M Evans |
| collection | DOAJ |
| description | It is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aetiology. However, single genetic variants typically explain small amounts of disease risk. Our idea is to construct allelic scores that explain greater proportions of the variance in biological intermediates, and subsequently use these scores to data mine GWAS. To investigate the approach's properties, we indexed three biological intermediates where the results of large GWAS meta-analyses were available: body mass index, C-reactive protein and low density lipoprotein levels. We generated allelic scores in the Avon Longitudinal Study of Parents and Children, and in publicly available data from the first Wellcome Trust Case Control Consortium. We compared the explanatory ability of allelic scores in terms of their capacity to proxy for the intermediate of interest, and the extent to which they associated with disease. We found that allelic scores derived from known variants and allelic scores derived from hundreds of thousands of genetic markers explained significant portions of the variance in biological intermediates of interest, and many of these scores showed expected correlations with disease. Genome-wide allelic scores however tended to lack specificity suggesting that they should be used with caution and perhaps only to proxy biological intermediates for which there are no known individual variants. Power calculations confirm the feasibility of extending our strategy to the analysis of tens of thousands of molecular phenotypes in large genome-wide meta-analyses. We conclude that our method represents a simple way in which potentially tens of thousands of molecular phenotypes could be screened for causal relationships with disease without having to expensively measure these variables in individual disease collections. |
| format | Article |
| id | doaj-art-c03a59d49387465eac5455262ccdf1bd |
| institution | OA Journals |
| issn | 1553-7390 1553-7404 |
| language | English |
| publishDate | 2013-10-01 |
| publisher | Public Library of Science (PLoS) |
| record_format | Article |
| series | PLoS Genetics |
| spelling | doaj-art-c03a59d49387465eac5455262ccdf1bd2025-08-20T02:22:49ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042013-10-01910e100391910.1371/journal.pgen.1003919Mining the human phenome using allelic scores that index biological intermediates.David M EvansMarie Jo A BrionLavinia PaternosterJohn P KempGeorge McMahonMarcus MunafòJohn B WhitfieldSarah E MedlandGrant W MontgomeryGIANT ConsortiumCRP ConsortiumTAG ConsortiumNicholas J TimpsonBeate St PourcainDebbie A LawlorNicholas G MartinAbbas DehghanJoel HirschhornGeorge Davey SmithIt is common practice in genome-wide association studies (GWAS) to focus on the relationship between disease risk and genetic variants one marker at a time. When relevant genes are identified it is often possible to implicate biological intermediates and pathways likely to be involved in disease aetiology. However, single genetic variants typically explain small amounts of disease risk. Our idea is to construct allelic scores that explain greater proportions of the variance in biological intermediates, and subsequently use these scores to data mine GWAS. To investigate the approach's properties, we indexed three biological intermediates where the results of large GWAS meta-analyses were available: body mass index, C-reactive protein and low density lipoprotein levels. We generated allelic scores in the Avon Longitudinal Study of Parents and Children, and in publicly available data from the first Wellcome Trust Case Control Consortium. We compared the explanatory ability of allelic scores in terms of their capacity to proxy for the intermediate of interest, and the extent to which they associated with disease. We found that allelic scores derived from known variants and allelic scores derived from hundreds of thousands of genetic markers explained significant portions of the variance in biological intermediates of interest, and many of these scores showed expected correlations with disease. Genome-wide allelic scores however tended to lack specificity suggesting that they should be used with caution and perhaps only to proxy biological intermediates for which there are no known individual variants. Power calculations confirm the feasibility of extending our strategy to the analysis of tens of thousands of molecular phenotypes in large genome-wide meta-analyses. We conclude that our method represents a simple way in which potentially tens of thousands of molecular phenotypes could be screened for causal relationships with disease without having to expensively measure these variables in individual disease collections.https://doi.org/10.1371/journal.pgen.1003919 |
| spellingShingle | David M Evans Marie Jo A Brion Lavinia Paternoster John P Kemp George McMahon Marcus Munafò John B Whitfield Sarah E Medland Grant W Montgomery GIANT Consortium CRP Consortium TAG Consortium Nicholas J Timpson Beate St Pourcain Debbie A Lawlor Nicholas G Martin Abbas Dehghan Joel Hirschhorn George Davey Smith Mining the human phenome using allelic scores that index biological intermediates. PLoS Genetics |
| title | Mining the human phenome using allelic scores that index biological intermediates. |
| title_full | Mining the human phenome using allelic scores that index biological intermediates. |
| title_fullStr | Mining the human phenome using allelic scores that index biological intermediates. |
| title_full_unstemmed | Mining the human phenome using allelic scores that index biological intermediates. |
| title_short | Mining the human phenome using allelic scores that index biological intermediates. |
| title_sort | mining the human phenome using allelic scores that index biological intermediates |
| url | https://doi.org/10.1371/journal.pgen.1003919 |
| work_keys_str_mv | AT davidmevans miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT mariejoabrion miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT laviniapaternoster miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT johnpkemp miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT georgemcmahon miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT marcusmunafo miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT johnbwhitfield miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT sarahemedland miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT grantwmontgomery miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT giantconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT crpconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT tagconsortium miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT nicholasjtimpson miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT beatestpourcain miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT debbiealawlor miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT nicholasgmartin miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT abbasdehghan miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT joelhirschhorn miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates AT georgedaveysmith miningthehumanphenomeusingallelicscoresthatindexbiologicalintermediates |