Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes

Abstract Tunnels in enzymes with buried active sites are key structural features allowing the entry of substrates and the release of products, thus contributing to the catalytic efficiency. Targeting the bottlenecks of protein tunnels is also a powerful protein engineering strategy. However, the ide...

Full description

Saved in:
Bibliographic Details
Main Authors: O. Vavra, J. Tyzack, F. Haddadi, J. Stourac, J. Damborsky, S. Mazurenko, J. M. Thornton, D. Bednar
Format: Article
Language:English
Published: BMC 2024-10-01
Series:Journal of Cheminformatics
Subjects:
Online Access:https://doi.org/10.1186/s13321-024-00907-z
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850275713991573504
author O. Vavra
J. Tyzack
F. Haddadi
J. Stourac
J. Damborsky
S. Mazurenko
J. M. Thornton
D. Bednar
author_facet O. Vavra
J. Tyzack
F. Haddadi
J. Stourac
J. Damborsky
S. Mazurenko
J. M. Thornton
D. Bednar
author_sort O. Vavra
collection DOAJ
description Abstract Tunnels in enzymes with buried active sites are key structural features allowing the entry of substrates and the release of products, thus contributing to the catalytic efficiency. Targeting the bottlenecks of protein tunnels is also a powerful protein engineering strategy. However, the identification of functional tunnels in multiple protein structures is a non-trivial task that can only be addressed computationally. We present a pipeline integrating automated structural analysis with an in-house machine-learning predictor for the annotation of protein pockets, followed by the calculation of the energetics of ligand transport via biochemically relevant tunnels. A thorough validation using eight distinct molecular systems revealed that CaverDock analysis of ligand un/binding is on par with time-consuming molecular dynamics simulations, but much faster. The optimized and validated pipeline was applied to annotate more than 17,000 cognate enzyme–ligand complexes. Analysis of ligand un/binding energetics indicates that the top priority tunnel has the most favourable energies in 75% of cases. Moreover, energy profiles of cognate ligands revealed that a simple geometry analysis can correctly identify tunnel bottlenecks only in 50% of cases. Our study provides essential information for the interpretation of results from tunnel calculation and energy profiling in mechanistic enzymology and protein engineering. We formulated several simple rules allowing identification of biochemically relevant tunnels based on the binding pockets, tunnel geometry, and ligand transport energy profiles. Scientific contributions The pipeline introduced in this work allows for the detailed analysis of a large set of protein–ligand complexes, focusing on transport pathways. We are introducing a novel predictor for determining the relevance of binding pockets for tunnel calculation. For the first time in the field, we present a high-throughput energetic analysis of ligand binding and unbinding, showing that approximate methods for these simulations can identify additional mutagenesis hotspots in enzymes compared to purely geometrical methods. The predictor is included in the supplementary material and can also be accessed at https://github.com/Faranehhad/Large-Scale-Pocket-Tunnel-Annotation.git . The tunnel data calculated in this study has been made publicly available as part of the ChannelsDB 2.0 database, accessible at https://channelsdb2.biodata.ceitec.cz/ .
format Article
id doaj-art-20fa0a24595e4051a8dec4e89a0250d6
institution OA Journals
issn 1758-2946
language English
publishDate 2024-10-01
publisher BMC
record_format Article
series Journal of Cheminformatics
spelling doaj-art-20fa0a24595e4051a8dec4e89a0250d62025-08-20T01:50:38ZengBMCJournal of Cheminformatics1758-29462024-10-0116111410.1186/s13321-024-00907-zLarge-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexesO. Vavra0J. Tyzack1F. Haddadi2J. Stourac3J. Damborsky4S. Mazurenko5J. M. Thornton6D. Bednar7Loschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityEuropean Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust GenomeCampusLoschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityLoschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityLoschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityLoschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityEuropean Molecular Biology Laboratory, European Bioinformatics Institute (EMBL-EBI), Wellcome Trust GenomeCampusLoschmidt Laboratories, Department of Experimental Biology and RECETOX, Faculty of Science, Masaryk UniversityAbstract Tunnels in enzymes with buried active sites are key structural features allowing the entry of substrates and the release of products, thus contributing to the catalytic efficiency. Targeting the bottlenecks of protein tunnels is also a powerful protein engineering strategy. However, the identification of functional tunnels in multiple protein structures is a non-trivial task that can only be addressed computationally. We present a pipeline integrating automated structural analysis with an in-house machine-learning predictor for the annotation of protein pockets, followed by the calculation of the energetics of ligand transport via biochemically relevant tunnels. A thorough validation using eight distinct molecular systems revealed that CaverDock analysis of ligand un/binding is on par with time-consuming molecular dynamics simulations, but much faster. The optimized and validated pipeline was applied to annotate more than 17,000 cognate enzyme–ligand complexes. Analysis of ligand un/binding energetics indicates that the top priority tunnel has the most favourable energies in 75% of cases. Moreover, energy profiles of cognate ligands revealed that a simple geometry analysis can correctly identify tunnel bottlenecks only in 50% of cases. Our study provides essential information for the interpretation of results from tunnel calculation and energy profiling in mechanistic enzymology and protein engineering. We formulated several simple rules allowing identification of biochemically relevant tunnels based on the binding pockets, tunnel geometry, and ligand transport energy profiles. Scientific contributions The pipeline introduced in this work allows for the detailed analysis of a large set of protein–ligand complexes, focusing on transport pathways. We are introducing a novel predictor for determining the relevance of binding pockets for tunnel calculation. For the first time in the field, we present a high-throughput energetic analysis of ligand binding and unbinding, showing that approximate methods for these simulations can identify additional mutagenesis hotspots in enzymes compared to purely geometrical methods. The predictor is included in the supplementary material and can also be accessed at https://github.com/Faranehhad/Large-Scale-Pocket-Tunnel-Annotation.git . The tunnel data calculated in this study has been made publicly available as part of the ChannelsDB 2.0 database, accessible at https://channelsdb2.biodata.ceitec.cz/ .https://doi.org/10.1186/s13321-024-00907-zBottleneckCognate ligandCavityEnzymeTunnelMachine learning
spellingShingle O. Vavra
J. Tyzack
F. Haddadi
J. Stourac
J. Damborsky
S. Mazurenko
J. M. Thornton
D. Bednar
Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
Journal of Cheminformatics
Bottleneck
Cognate ligand
Cavity
Enzyme
Tunnel
Machine learning
title Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
title_full Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
title_fullStr Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
title_full_unstemmed Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
title_short Large-scale annotation of biochemically relevant pockets and tunnels in cognate enzyme–ligand complexes
title_sort large scale annotation of biochemically relevant pockets and tunnels in cognate enzyme ligand complexes
topic Bottleneck
Cognate ligand
Cavity
Enzyme
Tunnel
Machine learning
url https://doi.org/10.1186/s13321-024-00907-z
work_keys_str_mv AT ovavra largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT jtyzack largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT fhaddadi largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT jstourac largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT jdamborsky largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT smazurenko largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT jmthornton largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes
AT dbednar largescaleannotationofbiochemicallyrelevantpocketsandtunnelsincognateenzymeligandcomplexes