Generalizable, fast, and accurate DeepQSPR with fastprop

Abstract Quantitative Structure–Property Relationship studies (QSPR), often referred to interchangeably as QSAR, seek to establish a mapping between molecular structure and an arbitrary target property. Historically this was done on a target-by-target basis with new descriptors being devised to spec...

Full description

Saved in:
Bibliographic Details
Main Authors: Jackson W. Burns, William H. Green
Format: Article
Language:English
Published: BMC 2025-05-01
Series:Journal of Cheminformatics
Subjects:
Online Access:https://doi.org/10.1186/s13321-025-01013-4
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849725267177635840
author Jackson W. Burns
William H. Green
author_facet Jackson W. Burns
William H. Green
author_sort Jackson W. Burns
collection DOAJ
description Abstract Quantitative Structure–Property Relationship studies (QSPR), often referred to interchangeably as QSAR, seek to establish a mapping between molecular structure and an arbitrary target property. Historically this was done on a target-by-target basis with new descriptors being devised to specifically map to a given target. Today software packages exist that calculate thousands of these descriptors, enabling general modeling typically with classical and machine learning methods. Also present today are learned representation methods in which deep learning models generate a target-specific representation during training. The former requires less training data and offers improved speed and interpretability while the latter offers excellent generality, while the intersection of the two remains under-explored. This paper introduces fastprop, a software package and general Deep-QSPR framework that combines a cogent set of molecular descriptors with deep learning to achieve state-of-the-art performance on datasets ranging from tens to tens of thousands of molecules. fastprop provides both a user-friendly Command Line Interface and highly interoperable set of Python modules for the training and deployment of feedforward neural networks for property prediction. This approach yields improvements in speed and interpretability over existing methods while statistically equaling or exceeding their performance across most of the tested benchmarks. fastprop is designed with Research Software Engineering best practices and is free and open source, hosted at github.com/jacksonburns/fastprop.
format Article
id doaj-art-47901b5f0f2a43b28e1d438759638f2c
institution DOAJ
issn 1758-2946
language English
publishDate 2025-05-01
publisher BMC
record_format Article
series Journal of Cheminformatics
spelling doaj-art-47901b5f0f2a43b28e1d438759638f2c2025-08-20T03:10:31ZengBMCJournal of Cheminformatics1758-29462025-05-0117111110.1186/s13321-025-01013-4Generalizable, fast, and accurate DeepQSPR with fastpropJackson W. Burns0William H. Green1Massachusetts Institute of TechnologyMassachusetts Institute of TechnologyAbstract Quantitative Structure–Property Relationship studies (QSPR), often referred to interchangeably as QSAR, seek to establish a mapping between molecular structure and an arbitrary target property. Historically this was done on a target-by-target basis with new descriptors being devised to specifically map to a given target. Today software packages exist that calculate thousands of these descriptors, enabling general modeling typically with classical and machine learning methods. Also present today are learned representation methods in which deep learning models generate a target-specific representation during training. The former requires less training data and offers improved speed and interpretability while the latter offers excellent generality, while the intersection of the two remains under-explored. This paper introduces fastprop, a software package and general Deep-QSPR framework that combines a cogent set of molecular descriptors with deep learning to achieve state-of-the-art performance on datasets ranging from tens to tens of thousands of molecules. fastprop provides both a user-friendly Command Line Interface and highly interoperable set of Python modules for the training and deployment of feedforward neural networks for property prediction. This approach yields improvements in speed and interpretability over existing methods while statistically equaling or exceeding their performance across most of the tested benchmarks. fastprop is designed with Research Software Engineering best practices and is free and open source, hosted at github.com/jacksonburns/fastprop.https://doi.org/10.1186/s13321-025-01013-4QSPRLearned representationsDeep learningMolecular descriptors
spellingShingle Jackson W. Burns
William H. Green
Generalizable, fast, and accurate DeepQSPR with fastprop
Journal of Cheminformatics
QSPR
Learned representations
Deep learning
Molecular descriptors
title Generalizable, fast, and accurate DeepQSPR with fastprop
title_full Generalizable, fast, and accurate DeepQSPR with fastprop
title_fullStr Generalizable, fast, and accurate DeepQSPR with fastprop
title_full_unstemmed Generalizable, fast, and accurate DeepQSPR with fastprop
title_short Generalizable, fast, and accurate DeepQSPR with fastprop
title_sort generalizable fast and accurate deepqspr with fastprop
topic QSPR
Learned representations
Deep learning
Molecular descriptors
url https://doi.org/10.1186/s13321-025-01013-4
work_keys_str_mv AT jacksonwburns generalizablefastandaccuratedeepqsprwithfastprop
AT williamhgreen generalizablefastandaccuratedeepqsprwithfastprop