-
Scaffolds as classes over drug-like chemical space
The ScaffoldGraph dataset is a collection of scaffolds and molecules. -
Singlet Fission Dataset
The dataset used comprises of about 1000 screened singlet fission molecules from previous literature -
Molecular Sets (MOSES)
The MOSES dataset is a benchmarking platform for molecular generation models. It contains a large collection of molecules with their 3D conformers calculated by RDKit. -
Single Particle Tracking
Single-particle tracking dataset, containing trajectories of molecules, cells, or animals. -
GEOM-DRUGS
The GEOM-DRUGS dataset contains 27M conformations for 300k molecules, and it serves as a standard benchmark for the machine learning-based 3D molecular generators. -
MoleculeNet dataset
The MoleculeNet dataset is a benchmarking platform for molecular machine learning. -
MOSES and HIV datasets
The MOSES dataset is a large dataset of molecules, and the HIV dataset is a subset of the MOSES dataset containing 41127 molecules. -
CheMBL, BindingDB, ToxCat
A collection of molecules with known activity against various proteases -
Protease dataset
A dataset of protease dataset containing molecules active against various protease in enzymatic assays from experimental pharmacology databases -
CrossDocked2020
Structure-based drug design (SBDD), which can be formulated as generating 3D molecules conditioned on protein pockets, is an important and challenging task in drug discovery. -
AUTODIFF: Autoregressive Diffusion Modeling for Structure-based Drug Design
Structure-based drug design (SBDD), which aims to generate molecules that can bind tightly to the target protein, is an essential problem in drug discovery, and previous... -
QM7 dataset
The dataset used in this paper for regression task on atomization energies of molecules.