Identification of nanomolar adenosine A2A receptor ligands using reinforcement learning and structure-based drug design

Abstract Generative chemical language models (CLMs) have demonstrated success in learning language-based molecular representations for de novo drug design. Here, we integrate structure-based drug design (SBDD) principles with CLMs to go from protein structure to novel small-molecule ligands, without...

Full description

Saved in:
Bibliographic Details
Main Authors: Morgan Thomas, Pierre G. Matricon, Robert J. Gillespie, Maja Napiórkowska, Hannah Neale, Jonathan S. Mason, Jason Brown, Kaan Harwood, Charlotte Fieldhouse, Nigel A. Swain, Tian Geng, Noel M. O’Boyle, Francesca Deflorian, Andreas Bender, Chris de Graaf
Format: Article
Language:English
Published: Nature Portfolio 2025-07-01
Series:Nature Communications
Online Access:https://doi.org/10.1038/s41467-025-60629-0
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Generative chemical language models (CLMs) have demonstrated success in learning language-based molecular representations for de novo drug design. Here, we integrate structure-based drug design (SBDD) principles with CLMs to go from protein structure to novel small-molecule ligands, without a priori knowledge of ligand chemistry. Using Augmented Hill-Climb, we successfully optimise multiple objectives within a practical timeframe, including protein-ligand complementarity. Resulting de novo molecules contain known or promising adenosine A2A receptor ligand chemistry that is not available in commercial vendor libraries, accessing commercially novel areas of chemical space. Experimental validation demonstrates a binding hit rate of 88%, with 50% having confirmed functional activity, including three nanomolar ligands and two novel chemotypes. The two strongest binders are co-crystallised with the A2A receptor, revealing their binding mechanisms that can be used to inform future iterations of structure-based de novo design, closing the AI SBDD loop.
ISSN:2041-1723