Compound dataset and custom code for deep generative multi-target compound design

Aim: Generating a data and software infrastructure for evaluating multi-target compound (MT-CPD) design via deep generative modeling. Methodology: The REINVENT 2.0 approach for generative modeling was extended for MT-CPD design and a large benchmark data set was curated. Exemplary results & data...

Full description

Saved in:
Bibliographic Details
Main Authors: Thomas Blaschke, Jürgen Bajorath
Format: Article
Language:English
Published: Taylor & Francis Group 2021-07-01
Series:Future Science OA
Subjects:
Online Access:https://www.future-science.com/doi/10.2144/fsoa-2021-0033
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Aim: Generating a data and software infrastructure for evaluating multi-target compound (MT-CPD) design via deep generative modeling. Methodology: The REINVENT 2.0 approach for generative modeling was extended for MT-CPD design and a large benchmark data set was curated. Exemplary results & data: Proof-of-concept for deep generative MT-CPD design was established. Custom code and the benchmark set comprising 2809 MT-CPDs, 61,928 single-target and 295,395 inactive compounds from biological screens are made freely available. Limitations & next steps: MT-CPD design via deep learning is still at its conceptual stages. It will be required to demonstrate experimental impact. The data and software we provide enable further investigation of MT-CPD design and generation of candidate molecules for experimental programs.
ISSN:2056-5623