UMAP-based clustering split for rigorous evaluation of AI models for virtual screening on cancer cell lines*

Abstract Virtual Screening (VS) of large compound libraries using Artificial Intelligence (AI) models is a highly effective approach for early drug discovery. Data splitting is crucial for benchmarking the performance of such AI models. Traditional random data splits often result in structurally sim...

Full description

Saved in:
Bibliographic Details
Main Authors: Qianrong Guo, Saiveth Hernandez-Hernandez, Pedro J. Ballester
Format: Article
Language:English
Published: BMC 2025-06-01
Series:Journal of Cheminformatics
Subjects:
Online Access:https://doi.org/10.1186/s13321-025-01039-8
Tags: Add Tag
No Tags, Be the first to tag this record!