Chemical Database - Groups

Mcule database

The Mcule database is a collection of purchasable compounds within known stock amounts. The network has 6,369,219 nodes and 8,808,841 edges.

Dataset
JSON

ChEMBL Dataset

The dataset consists of all small-molecules from the ChEMBL database that were of length 500 tokens or less.

Dataset
JSON

PubChem

The PubChem database is a comprehensive repository of chemical compounds, with each compound having a unique identifier.

Dataset
JSON

BindingDB

The dataset is used for compound-protein binding affinity prediction, and it contains 218,615 compound-protein pairs.

Dataset
JSON

ZINC250K

The ZINC250K dataset is a large dataset of molecules used for molecular design and generation. It contains 250,000 molecules with their corresponding properties and structures.

Dataset
JSON

DrugBank

The dataset used in the paper is a sparse user-item interaction matrix, a protein-protein similarity matrix, and a drug-drug similarity matrix.

Dataset
JSON

ChEMBL database

The dataset used in this paper is the ChEMBL database, which contains drugs/molecules and their binding information for proteins Lyn, Lck, and Src.

Dataset
JSON

ChEMBL

The ChEMBL dataset is a large collection of bioactive molecules, with over 10 million molecules, that can be used for various machine learning tasks, including molecular design.

Dataset
JSON

QM9

A dataset of small molecules for benchmarking molecule generation methods. The dataset consists of fingerprints of the molecules, and the goal is to predict the original...

Dataset
JSON

9 datasets found