A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.

AlphaFold2 and RoseTTAfold represent a transformative advance for predicting protein structure. They are able to make very high-quality predictions given a high-quality alignment of the protein sequence with related proteins. These predictions are now readily available via the AlphaFold database of...

Full description

Saved in:
Bibliographic Details
Main Author: Richard John Wheeler
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0259871&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850127341457506304
author Richard John Wheeler
author_facet Richard John Wheeler
author_sort Richard John Wheeler
collection DOAJ
description AlphaFold2 and RoseTTAfold represent a transformative advance for predicting protein structure. They are able to make very high-quality predictions given a high-quality alignment of the protein sequence with related proteins. These predictions are now readily available via the AlphaFold database of predicted structures and AlphaFold or RoseTTAfold Colaboratory notebooks for custom predictions. However, predictions for some species tend to be lower confidence than model organisms. Problematic species include Trypanosoma cruzi and Leishmania infantum: important unicellular eukaryotic human parasites in an early-branching eukaryotic lineage. The cause appears to be due to poor sampling of this branch of life (Discoba) in the protein sequences databases used for the AlphaFold database and ColabFold. Here, by comprehensively gathering openly available protein sequence data for Discoba species, significant improvements to AlphaFold2 protein structure prediction over the AlphaFold database and ColabFold are demonstrated. This is made available as an easy-to-use tool for the parasitology community in the form of Colaboratory notebooks for generating multiple sequence alignments and AlphaFold2 predictions of protein structure for Trypanosoma, Leishmania and related species.
format Article
id doaj-art-83b2109ced794341949d3298902a88ba
institution OA Journals
issn 1932-6203
language English
publishDate 2021-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-83b2109ced794341949d3298902a88ba2025-08-20T02:33:43ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-011611e025987110.1371/journal.pone.0259871A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.Richard John WheelerAlphaFold2 and RoseTTAfold represent a transformative advance for predicting protein structure. They are able to make very high-quality predictions given a high-quality alignment of the protein sequence with related proteins. These predictions are now readily available via the AlphaFold database of predicted structures and AlphaFold or RoseTTAfold Colaboratory notebooks for custom predictions. However, predictions for some species tend to be lower confidence than model organisms. Problematic species include Trypanosoma cruzi and Leishmania infantum: important unicellular eukaryotic human parasites in an early-branching eukaryotic lineage. The cause appears to be due to poor sampling of this branch of life (Discoba) in the protein sequences databases used for the AlphaFold database and ColabFold. Here, by comprehensively gathering openly available protein sequence data for Discoba species, significant improvements to AlphaFold2 protein structure prediction over the AlphaFold database and ColabFold are demonstrated. This is made available as an easy-to-use tool for the parasitology community in the form of Colaboratory notebooks for generating multiple sequence alignments and AlphaFold2 predictions of protein structure for Trypanosoma, Leishmania and related species.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0259871&type=printable
spellingShingle Richard John Wheeler
A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
PLoS ONE
title A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
title_full A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
title_fullStr A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
title_full_unstemmed A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
title_short A resource for improved predictions of Trypanosoma and Leishmania protein three-dimensional structure.
title_sort resource for improved predictions of trypanosoma and leishmania protein three dimensional structure
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0259871&type=printable
work_keys_str_mv AT richardjohnwheeler aresourceforimprovedpredictionsoftrypanosomaandleishmaniaproteinthreedimensionalstructure
AT richardjohnwheeler resourceforimprovedpredictionsoftrypanosomaandleishmaniaproteinthreedimensionalstructure