-
DUD-E and PDBBind
A new benchmark dataset for structure-based virtual screening, constructed based on DUD-E and PDBBind databases. -
CrossDocked 2022
CrossDocked 2022 dataset for structure-based virtual screening, used for benchmarking docking methods. -
MOSES and HIV datasets
The MOSES dataset is a large dataset of molecules, and the HIV dataset is a subset of the MOSES dataset containing 41127 molecules. -
Lit-pcba: an unbiased data set for machine learning and virtual screening
The LIT-PCBA dataset contains experimentally confirmed active and inactive compounds for 15 targets. -
DrugCLIP: Contrastive Protein-Molecule Representation Learning for Virtual Sc...
Virtual screening, which identifies potential drugs from vast compound databases to bind with a particular protein pocket, is a critical step in AI-assisted drug discovery.... -
Ligand-Based Virtual Screening dataset
The dataset used in the paper is the Ligand-Based Virtual Screening dataset, which is a large-scale dataset for virtual screening.