FLORA: a novel method to predict protein function from structure in diverse superfamilies.

Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to...

Full description

Saved in:
Bibliographic Details
Main Authors: Oliver C Redfern, Benoît H Dessailly, Timothy J Dallman, Ian Sillitoe, Christine A Orengo
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2009-08-01
Series:PLoS Computational Biology
Online Access:https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000485&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850131946253844480
author Oliver C Redfern
Benoît H Dessailly
Timothy J Dallman
Ian Sillitoe
Christine A Orengo
author_facet Oliver C Redfern
Benoît H Dessailly
Timothy J Dallman
Ian Sillitoe
Christine A Orengo
author_sort Oliver C Redfern
collection DOAJ
description Predicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2-3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (alpha, beta, alphabeta) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues.
format Article
id doaj-art-c195f611f8764ff080c12c4f6af5ce76
institution OA Journals
issn 1553-734X
1553-7358
language English
publishDate 2009-08-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS Computational Biology
spelling doaj-art-c195f611f8764ff080c12c4f6af5ce762025-08-20T02:32:19ZengPublic Library of Science (PLoS)PLoS Computational Biology1553-734X1553-73582009-08-0158e100048510.1371/journal.pcbi.1000485FLORA: a novel method to predict protein function from structure in diverse superfamilies.Oliver C RedfernBenoît H DessaillyTimothy J DallmanIan SillitoeChristine A OrengoPredicting protein function from structure remains an active area of interest, particularly for the structural genomics initiatives where a substantial number of structures are initially solved with little or no functional characterisation. Although global structure comparison methods can be used to transfer functional annotations, the relationship between fold and function is complex, particularly in functionally diverse superfamilies that have evolved through different secondary structure embellishments to a common structural core. The majority of prediction algorithms employ local templates built on known or predicted functional residues. Here, we present a novel method (FLORA) that automatically generates structural motifs associated with different functional sub-families (FSGs) within functionally diverse domain superfamilies. Templates are created purely on the basis of their specificity for a given FSG, and the method makes no prior prediction of functional sites, nor assumes specific physico-chemical properties of residues. FLORA is able to accurately discriminate between homologous domains with different functions and substantially outperforms (a 2-3 fold increase in coverage at low error rates) popular structure comparison methods and a leading function prediction method. We benchmark FLORA on a large data set of enzyme superfamilies from all three major protein classes (alpha, beta, alphabeta) and demonstrate the functional relevance of the motifs it identifies. We also provide novel predictions of enzymatic activity for a large number of structures solved by the Protein Structure Initiative. Overall, we show that FLORA is able to effectively detect functionally similar protein domain structures by purely using patterns of structural conservation of all residues.https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000485&type=printable
spellingShingle Oliver C Redfern
Benoît H Dessailly
Timothy J Dallman
Ian Sillitoe
Christine A Orengo
FLORA: a novel method to predict protein function from structure in diverse superfamilies.
PLoS Computational Biology
title FLORA: a novel method to predict protein function from structure in diverse superfamilies.
title_full FLORA: a novel method to predict protein function from structure in diverse superfamilies.
title_fullStr FLORA: a novel method to predict protein function from structure in diverse superfamilies.
title_full_unstemmed FLORA: a novel method to predict protein function from structure in diverse superfamilies.
title_short FLORA: a novel method to predict protein function from structure in diverse superfamilies.
title_sort flora a novel method to predict protein function from structure in diverse superfamilies
url https://journals.plos.org/ploscompbiol/article/file?id=10.1371/journal.pcbi.1000485&type=printable
work_keys_str_mv AT olivercredfern floraanovelmethodtopredictproteinfunctionfromstructureindiversesuperfamilies
AT benoithdessailly floraanovelmethodtopredictproteinfunctionfromstructureindiversesuperfamilies
AT timothyjdallman floraanovelmethodtopredictproteinfunctionfromstructureindiversesuperfamilies
AT iansillitoe floraanovelmethodtopredictproteinfunctionfromstructureindiversesuperfamilies
AT christineaorengo floraanovelmethodtopredictproteinfunctionfromstructureindiversesuperfamilies