A Multi‐Task Self‐Supervised Strategy for Predicting Molecular Properties and FGFR1 Inhibitors

Abstract Studying the molecular properties of drugs and their interactions with human targets aids in better understanding the clinical performance of drugs and guides drug development. In computer‐aided drug discovery, it is crucial to utilize effective molecular feature representations for predict...

Full description

Saved in:
Bibliographic Details
Main Authors: Xin Yang, Yang Wang, Ye Lin, Mingxuan Zhang, Ou Liu, Jianwei Shuai, Qi Zhao
Format: Article
Language:English
Published: Wiley 2025-04-01
Series:Advanced Science
Subjects:
Online Access:https://doi.org/10.1002/advs.202412987
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Studying the molecular properties of drugs and their interactions with human targets aids in better understanding the clinical performance of drugs and guides drug development. In computer‐aided drug discovery, it is crucial to utilize effective molecular feature representations for predicting molecular properties and designing ligands with high binding affinity to targets. However, designing an effective multi‐task and self‐supervised strategy remains a significant challenge for the pretraining framework. In this study, a multi‐task self‐supervised deep learning framework is proposed, MTSSMol, which utilizes ≈10 million unlabeled drug‐like molecules for pretraining to identify potential inhibitors of fibroblast growth factor receptor 1 (FGFR1). During the pretraining of MTSSMol, molecular representations are learned through a graph neural networks (GNNs) encoder. A multi‐task self‐supervised pretraining strategy is proposed to fully capture the structural and chemical knowledge of molecules. Extensive computational tests on 27 datasets demonstrate that MTSSMol exhibits exceptional performance in predicting molecular properties across different domains. Moreover, MTSSMol's capability is validated to identify potential inhibitors of FGFR1 through molecular docking using RoseTTAFold All‐Atom (RFAA) and molecular dynamics simulations. Overall, MTSSMol provides an effective algorithmic framework for enhancing molecular representation learning and identifying potential drug candidates, offering a valuable tool to accelerate drug discovery processes. All of the codes are freely available online at https:// github.com/zhaoqi106/MTSSMol.
ISSN:2198-3844