9 datasets found

Formats: JSON

Filter Results
  • Mcule database

    The Mcule database is a collection of purchasable compounds within known stock amounts. The network has 6,369,219 nodes and 8,808,841 edges.
  • ChEMBL Dataset

    The dataset consists of all small-molecules from the ChEMBL database that were of length 500 tokens or less.
  • PubChem

    The PubChem database is a comprehensive repository of chemical compounds, with each compound having a unique identifier.
  • BindingDB

    The dataset is used for compound-protein binding affinity prediction, and it contains 218,615 compound-protein pairs.
  • ZINC250K

    The ZINC250K dataset is a large dataset of molecules used for molecular design and generation. It contains 250,000 molecules with their corresponding properties and structures.
  • DrugBank

    The dataset used in the paper is a sparse user-item interaction matrix, a protein-protein similarity matrix, and a drug-drug similarity matrix.
  • ChEMBL database

    The dataset used in this paper is the ChEMBL database, which contains drugs/molecules and their binding information for proteins Lyn, Lck, and Src.
  • ChEMBL

    The ChEMBL dataset is a large collection of bioactive molecules, with over 10 million molecules, that can be used for various machine learning tasks, including molecular design.
  • QM9

    A dataset of small molecules for benchmarking molecule generation methods. The dataset consists of fingerprints of the molecules, and the goal is to predict the original...