-
Diabetes and Asia datasets
The Diabetes and Asia datasets were used for the experiments. -
HIV-1 protease cleavage dataset
The HIV-1 protease cleavage dataset is compiled from four data source files, with the primary purpose to develop effective protease cleavage inhibitors by predicting whether the... -
Chemical-Disease Relations (CDR) dataset
The Chemical-Disease Relations (CDR) dataset was built for the BioCreative V challenge and annotated with one relation "chemical-induced disease" manually. -
UniProtKB Human Gene binding prediction
UniProtKB Human Gene binding prediction -
IEDB weekly automated benchmark datasets
IEDB weekly automated benchmark datasets -
LGG Multi-omic Data
The dataset is a collection of multi-omic data from lower-grade glioma (LGG) tumor samples collected by the TCGA Research Network. -
Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectru...
The dataset used for diagnosis analysis of obesity spectrum disorders -
Alternative Splicing
Alternative Splicing is a dataset of RNA sequences used for predicting alternative gene splicing. -
SCOPe dataset
Structural Classification of Proteins — extended (SCOPe) dataset -
Mutag dataset
Mutag dataset is a benchmark dataset for graph neural networks, containing 188 cancer and 67 non-cancer cells. -
Enzyme Structure Dataset
This dataset contains enzyme structures and their corresponding features. -
Protein Structure Dataset
This dataset contains protein structures and their corresponding features. -
Promoter Design
We use the promoter DNA sequence dataset containing 100k promoter sequences with the corresponding transcription initiation signal profiles. -
Binding MOAD
The dataset is used for compound-protein binding affinity prediction, and it contains 1963 compound-protein pairs. -
Benchmark datasets
The dataset used in the paper is a collection of small images, each representing a patch of a jigsaw puzzle. The patches are of the same size and orientation, and the goal is to...