Integration of pre-trained protein language models with equivariant graph neural networks for peptide toxicity prediction

Abstract Background Peptide-based therapeutics have great potential due to their versatility, high specificity, and suitability for a variety of therapeutic applications. Despite these advantages, the inherent toxicities of some peptides pose challenges in drug development. Several computational met...

Full description

Saved in:
Bibliographic Details
Main Authors: Shihu Jiao, Xiucai Ye, Tetsuya Sakurai, Quan Zou, Wu Han, Chao Zhan
Format: Article
Language:English
Published: BMC 2025-07-01
Series:BMC Biology
Subjects:
Online Access:https://doi.org/10.1186/s12915-025-02329-1
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract Background Peptide-based therapeutics have great potential due to their versatility, high specificity, and suitability for a variety of therapeutic applications. Despite these advantages, the inherent toxicities of some peptides pose challenges in drug development. Several computational methods have been developed to allow rapid and efficient large-scale screening of peptide toxicity. However, these methods mainly rely on the primary sequence and often ignore critical structural information, which limits their predictive accuracy. Results In this study, we introduce a novel framework named StrucToxNet that integrates a pre-trained protein language model with an equivariant graph neural network to improve peptide toxicity prediction. By combining sequence embeddings from the ProtT5 language model and 3D structural data predicted by ESMFold, StrucToxNet can capture both sequential and spatial characteristics of peptides. Testing on the independent dataset indicates that StrucToxNet outperforms existing sequence-based models in various metrics, achieving higher balanced accuracy and overall performance. Conclusions The results demonstrate the robustness and generalizability of StrucToxNet, marking it a reliable tool in the computational screening of toxic peptides and facilitating safer peptide-based drug development.
ISSN:1741-7007