The dataset used in this study consists of 1.6M chemical molecules from the MOSES benchmarking dataset, 211k ligand molecules from the BindingDB database, and 12 in vitro...
The ZINC250K dataset is a large dataset of molecules used for molecular design and generation. It contains 250,000 molecules with their corresponding properties and structures.