Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.

Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informat...

Full description

Saved in:
Bibliographic Details
Main Authors: Andrej Kuritzin, Tabea Kischka, Jürgen Schmitz, Gennady Churakov
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2016-03-01
Series:PLoS Computational Biology
Online Access:https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004812&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850189222086967296
author Andrej Kuritzin
Tabea Kischka
Jürgen Schmitz
Gennady Churakov
author_facet Andrej Kuritzin
Tabea Kischka
Jürgen Schmitz
Gennady Churakov
author_sort Andrej Kuritzin
collection DOAJ
description Ancient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/.
format Article
id doaj-art-4e2c31c65d204708a7fede8560b1882c
institution OA Journals
issn 1553-734X
1553-7358
language English
publishDate 2016-03-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj-art-4e2c31c65d204708a7fede8560b1882c2025-08-20T02:15:40ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582016-03-01123e100481210.1371/journal.pcbi.1004812Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.Andrej KuritzinTabea KischkaJürgen SchmitzGennady ChurakovAncient retroposon insertions can be used as virtually homoplasy-free markers to reconstruct the phylogenetic history of species. Inherited, orthologous insertions in related species offer reliable signals of a common origin of the given species. One prerequisite for such a phylogenetically informative insertion is that the inserted element was fixed in the ancestral population before speciation; if not, polymorphically inserted elements may lead to random distributions of presence/absence states during speciation and possibly to apparently conflicting reconstructions of their ancestry. Fortunately, such misleading fixed cases are relatively rare but nevertheless, need to be considered. Here, we present novel, comprehensive statistical models applicable for (1) analyzing any pattern of rare genomic changes, (2) testing and differentiating conflicting phylogenetic reconstructions based on rare genomic changes caused by incomplete lineage sorting or/and ancestral hybridization, and (3) differentiating between search strategies involving genome information from one or several lineages. When the new statistics are applied, in non-conflicting cases a minimum of three elements present in both of two species and absent in a third group are considered significant support (p<0.05) for the branching of the third from the other two, if all three of the given species are screened equally for genome or experimental data. Five elements are necessary for significant support (p<0.05) if a diagnostic locus derived from only one of three species is screened, and no conflicting markers are detected. Most potentially conflicting patterns can be evaluated for their significance and ancestral hybridization can be distinguished from incomplete lineage sorting by considering symmetric or asymmetric distribution of rare genomic changes among possible tree configurations. Additionally, we provide an R-application to make the new KKSC insertion significance test available for the scientific community at http://retrogenomics.uni-muenster.de:3838/KKSC_significance_test/.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004812&type=printable
spellingShingle Andrej Kuritzin
Tabea Kischka
Jürgen Schmitz
Gennady Churakov
Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
PLoS Computational Biology
title Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
title_full Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
title_fullStr Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
title_full_unstemmed Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
title_short Incomplete Lineage Sorting and Hybridization Statistics for Large-Scale Retroposon Insertion Data.
title_sort incomplete lineage sorting and hybridization statistics for large scale retroposon insertion data
url https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1004812&type=printable
work_keys_str_mv AT andrejkuritzin incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT tabeakischka incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT jurgenschmitz incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata
AT gennadychurakov incompletelineagesortingandhybridizationstatisticsforlargescaleretroposoninsertiondata