Multi-locus analysis of genomic time series data from experimental evolution.

Genomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of...

Full description

Saved in:
Bibliographic Details
Main Authors: Jonathan Terhorst, Christian Schlötterer, Yun S Song
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2015-04-01
Series:PLoS Genetics
Online Access:https://doi.org/10.1371/journal.pgen.1005069
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850161719813341184
author Jonathan Terhorst
Christian Schlötterer
Yun S Song
author_facet Jonathan Terhorst
Christian Schlötterer
Yun S Song
author_sort Jonathan Terhorst
collection DOAJ
description Genomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of modern experimental evolution data. To address this problem, we develop a Gaussian process approximation to the multi-locus Wright-Fisher process with selection over a time course of tens of generations. The mean and covariance structure of the Gaussian process are obtained by computing the corresponding moments in discrete-time Wright-Fisher models conditioned on the presence of a linked selected site. This enables our method to account for the effects of linkage and selection, both along the genome and across sampled time points, in an approximate but principled manner. We first use simulated data to demonstrate the power of our method to correctly detect, locate and estimate the fitness of a selected allele from among several linked sites. We study how this power changes for different values of selection strength, initial haplotypic diversity, population size, sampling frequency, experimental duration, number of replicates, and sequencing coverage depth. In addition to providing quantitative estimates of selection parameters from experimental evolution data, our model can be used by practitioners to design E&R experiments with requisite power. We also explore how our likelihood-based approach can be used to infer other model parameters, including effective population size and recombination rate. Then, we apply our method to analyze genome-wide data from a real E&R experiment designed to study the adaptation of D. melanogaster to a new laboratory environment with alternating cold and hot temperatures.
format Article
id doaj-art-b33cb61bfc604026a6a3a2b0091ba404
institution OA Journals
issn 1553-7390
1553-7404
language English
publishDate 2015-04-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Genetics
spelling doaj-art-b33cb61bfc604026a6a3a2b0091ba4042025-08-20T02:22:45ZengPublic Library of Science (PLoS)PLoS Genetics1553-73901553-74042015-04-01114e100506910.1371/journal.pgen.1005069Multi-locus analysis of genomic time series data from experimental evolution.Jonathan TerhorstChristian SchlöttererYun S SongGenomic time series data generated by evolve-and-resequence (E&R) experiments offer a powerful window into the mechanisms that drive evolution. However, standard population genetic inference procedures do not account for sampling serially over time, and new methods are needed to make full use of modern experimental evolution data. To address this problem, we develop a Gaussian process approximation to the multi-locus Wright-Fisher process with selection over a time course of tens of generations. The mean and covariance structure of the Gaussian process are obtained by computing the corresponding moments in discrete-time Wright-Fisher models conditioned on the presence of a linked selected site. This enables our method to account for the effects of linkage and selection, both along the genome and across sampled time points, in an approximate but principled manner. We first use simulated data to demonstrate the power of our method to correctly detect, locate and estimate the fitness of a selected allele from among several linked sites. We study how this power changes for different values of selection strength, initial haplotypic diversity, population size, sampling frequency, experimental duration, number of replicates, and sequencing coverage depth. In addition to providing quantitative estimates of selection parameters from experimental evolution data, our model can be used by practitioners to design E&R experiments with requisite power. We also explore how our likelihood-based approach can be used to infer other model parameters, including effective population size and recombination rate. Then, we apply our method to analyze genome-wide data from a real E&R experiment designed to study the adaptation of D. melanogaster to a new laboratory environment with alternating cold and hot temperatures.https://doi.org/10.1371/journal.pgen.1005069
spellingShingle Jonathan Terhorst
Christian Schlötterer
Yun S Song
Multi-locus analysis of genomic time series data from experimental evolution.
PLoS Genetics
title Multi-locus analysis of genomic time series data from experimental evolution.
title_full Multi-locus analysis of genomic time series data from experimental evolution.
title_fullStr Multi-locus analysis of genomic time series data from experimental evolution.
title_full_unstemmed Multi-locus analysis of genomic time series data from experimental evolution.
title_short Multi-locus analysis of genomic time series data from experimental evolution.
title_sort multi locus analysis of genomic time series data from experimental evolution
url https://doi.org/10.1371/journal.pgen.1005069
work_keys_str_mv AT jonathanterhorst multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution
AT christianschlotterer multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution
AT yunssong multilocusanalysisofgenomictimeseriesdatafromexperimentalevolution