Novel EDGE encoding method enhances ability to identify genetic interactions.

Assumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analy...

Full description

Saved in:
Bibliographic Details
Main Authors: Molly A Hall, John Wallace, Anastasia M Lucas, Yuki Bradford, Shefali S Verma, Bertram Müller-Myhsok, Kristin Passero, Jiayan Zhou, John McGuigan, Beibei Jiang, Sarah A Pendergrass, Yanfei Zhang, Peggy Peissig, Murray Brilliant, Patrick Sleiman, Hakon Hakonarson, John B Harley, Krzysztof Kiryluk, Kristel Van Steen, Jason H Moore, Marylyn D Ritchie
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-06-01
Series:PLoS Genetics
Online Access:https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1009534&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849337137648893952
author Molly A Hall
John Wallace
Anastasia M Lucas
Yuki Bradford
Shefali S Verma
Bertram Müller-Myhsok
Kristin Passero
Jiayan Zhou
John McGuigan
Beibei Jiang
Sarah A Pendergrass
Yanfei Zhang
Peggy Peissig
Murray Brilliant
Patrick Sleiman
Hakon Hakonarson
John B Harley
Krzysztof Kiryluk
Kristel Van Steen
Jason H Moore
Marylyn D Ritchie
author_facet Molly A Hall
John Wallace
Anastasia M Lucas
Yuki Bradford
Shefali S Verma
Bertram Müller-Myhsok
Kristin Passero
Jiayan Zhou
John McGuigan
Beibei Jiang
Sarah A Pendergrass
Yanfei Zhang
Peggy Peissig
Murray Brilliant
Patrick Sleiman
Hakon Hakonarson
John B Harley
Krzysztof Kiryluk
Kristel Van Steen
Jason H Moore
Marylyn D Ritchie
author_sort Molly A Hall
collection DOAJ
description Assumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analyses with every combination of encodings raises the multiple testing burden. Here, we present a novel and flexible encoding for genetic interactions, the elastic data-driven genetic encoding (EDGE), in which SNPs are assigned a heterozygous value based on the genetic model they demonstrate in a dataset prior to interaction testing. We assessed the power of EDGE to detect genetic interactions using 29 combinations of simulated genetic models and found it outperformed the traditional encoding methods across 10%, 30%, and 50% minor allele frequencies (MAFs). Further, EDGE maintained a low false-positive rate, while additive and dominant encodings demonstrated inflation. We evaluated EDGE and the traditional encodings with genetic data from the Electronic Medical Records and Genomics (eMERGE) Network for five phenotypes: age-related macular degeneration (AMD), age-related cataract, glaucoma, type 2 diabetes (T2D), and resistant hypertension. A multi-encoding genome-wide association study (GWAS) for each phenotype was performed using the traditional encodings, and the top results of the multi-encoding GWAS were considered for SNP-SNP interaction using the traditional encodings and EDGE. EDGE identified a novel SNP-SNP interaction for age-related cataract that no other method identified: rs7787286 (MAF: 0.041; intergenic region of chromosome 7)-rs4695885 (MAF: 0.34; intergenic region of chromosome 4) with a Bonferroni LRT p of 0.018. A SNP-SNP interaction was found in data from the UK Biobank within 25 kb of these SNPs using the recessive encoding: rs60374751 (MAF: 0.030) and rs6843594 (MAF: 0.34) (Bonferroni LRT p: 0.026). We recommend using EDGE to flexibly detect interactions between SNPs exhibiting diverse action.
format Article
id doaj-art-b37fdfc15813491fa1c98bc3372f6d22
institution Kabale University
issn 1553-7390
1553-7404
language English
publishDate 2021-06-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj-art-b37fdfc15813491fa1c98bc3372f6d222025-08-20T03:44:46ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042021-06-01176e100953410.1371/journal.pgen.1009534Novel EDGE encoding method enhances ability to identify genetic interactions.Molly A HallJohn WallaceAnastasia M LucasYuki BradfordShefali S VermaBertram Müller-MyhsokKristin PasseroJiayan ZhouJohn McGuiganBeibei JiangSarah A PendergrassYanfei ZhangPeggy PeissigMurray BrilliantPatrick SleimanHakon HakonarsonJohn B HarleyKrzysztof KirylukKristel Van SteenJason H MooreMarylyn D RitchieAssumptions are made about the genetic model of single nucleotide polymorphisms (SNPs) when choosing a traditional genetic encoding: additive, dominant, and recessive. Furthermore, SNPs across the genome are unlikely to demonstrate identical genetic models. However, running SNP-SNP interaction analyses with every combination of encodings raises the multiple testing burden. Here, we present a novel and flexible encoding for genetic interactions, the elastic data-driven genetic encoding (EDGE), in which SNPs are assigned a heterozygous value based on the genetic model they demonstrate in a dataset prior to interaction testing. We assessed the power of EDGE to detect genetic interactions using 29 combinations of simulated genetic models and found it outperformed the traditional encoding methods across 10%, 30%, and 50% minor allele frequencies (MAFs). Further, EDGE maintained a low false-positive rate, while additive and dominant encodings demonstrated inflation. We evaluated EDGE and the traditional encodings with genetic data from the Electronic Medical Records and Genomics (eMERGE) Network for five phenotypes: age-related macular degeneration (AMD), age-related cataract, glaucoma, type 2 diabetes (T2D), and resistant hypertension. A multi-encoding genome-wide association study (GWAS) for each phenotype was performed using the traditional encodings, and the top results of the multi-encoding GWAS were considered for SNP-SNP interaction using the traditional encodings and EDGE. EDGE identified a novel SNP-SNP interaction for age-related cataract that no other method identified: rs7787286 (MAF: 0.041; intergenic region of chromosome 7)-rs4695885 (MAF: 0.34; intergenic region of chromosome 4) with a Bonferroni LRT p of 0.018. A SNP-SNP interaction was found in data from the UK Biobank within 25 kb of these SNPs using the recessive encoding: rs60374751 (MAF: 0.030) and rs6843594 (MAF: 0.34) (Bonferroni LRT p: 0.026). We recommend using EDGE to flexibly detect interactions between SNPs exhibiting diverse action.https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1009534&type=printable
spellingShingle Molly A Hall
John Wallace
Anastasia M Lucas
Yuki Bradford
Shefali S Verma
Bertram Müller-Myhsok
Kristin Passero
Jiayan Zhou
John McGuigan
Beibei Jiang
Sarah A Pendergrass
Yanfei Zhang
Peggy Peissig
Murray Brilliant
Patrick Sleiman
Hakon Hakonarson
John B Harley
Krzysztof Kiryluk
Kristel Van Steen
Jason H Moore
Marylyn D Ritchie
Novel EDGE encoding method enhances ability to identify genetic interactions.
PLoS Genetics
title Novel EDGE encoding method enhances ability to identify genetic interactions.
title_full Novel EDGE encoding method enhances ability to identify genetic interactions.
title_fullStr Novel EDGE encoding method enhances ability to identify genetic interactions.
title_full_unstemmed Novel EDGE encoding method enhances ability to identify genetic interactions.
title_short Novel EDGE encoding method enhances ability to identify genetic interactions.
title_sort novel edge encoding method enhances ability to identify genetic interactions
url https://journals.plos.org/plosgenetics/article/file?id=10.1371/journal.pgen.1009534&type=printable
work_keys_str_mv AT mollyahall noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT johnwallace noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT anastasiamlucas noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT yukibradford noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT shefalisverma noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT bertrammullermyhsok noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT kristinpassero noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT jiayanzhou noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT johnmcguigan noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT beibeijiang noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT sarahapendergrass noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT yanfeizhang noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT peggypeissig noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT murraybrilliant noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT patricksleiman noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT hakonhakonarson noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT johnbharley noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT krzysztofkiryluk noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT kristelvansteen noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT jasonhmoore noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions
AT marylyndritchie noveledgeencodingmethodenhancesabilitytoidentifygeneticinteractions