-
Coffee Canephora
The dataset contains a large number of features over a genome for a relatively small number of individuals. -
Wheat (Zuchtwert study)
The dataset contains a large number of features over a genome for a relatively small number of individuals. -
Arabica Coffee
The dataset contains a large number of features over a genome for a relatively small number of individuals. -
Colorado Beetle
The dataset contains a large number of features over a genome for a relatively small number of individuals. -
Eucalyptus
The dataset contains a large number of features over a genome for a relatively small number of individuals. -
TCGA-CRC-DX dataset
The TCGA-CRC-DX dataset comprises whole slide images from 360 CRC-diagnosed patients, which includes DNA mutations, RNA expressions, and clinical annotations, alongside the... -
Microbial DNA dataset
The dataset contains 100,000 microbial DNA samples. -
2021-2022 Patient Dataset
A dataset of patient specimens with karyograms, used to evaluate the performance of the deep learning model on aberration detection. -
Karyotype AI for Precision Oncology
A large-scale dataset of karyograms for hematological malignancies, used to train and evaluate a deep learning model for chromosome aberration detection. -
The 1000 Genomes Project
Human genetic variation data set containing information about genetic variations in humans. -
L008 dataset
The L008 dataset is a CROP-seq platform designed to showcase the power of our model in conjunction with modern genomics. -
Marson dataset
The Marson dataset contains perturbations of 73 unique genes where the intervention served to increase the expression of those genes. -
Sciplex dataset
The Sciplex dataset consists of three cancer cell lines (A549, MCF7, K562) treated with 188 compounds. -
New Zealand dairy cow genotyping data
A dataset of genotyping data from the New Zealand dairy cow population. -
Sequence Read Archive (NCBI), Beijing Institute of Genomics (BIG), Chinese Ac...
Raw RNA-seq data from SARS-CoV-2, SARS-CoV and MERS-CoV infected cell lines and COVID-19 patient-derived samples -
A map of human genome variation from population-scale sequencing
A map of human genome variation from population-scale sequencing -
A haplotype map of the human genome
A haplotype map of the human genome -
GTEx consortium dataset
The GTEx consortium dataset contains gene expression measurements for skeletal muscle tissue. -
Longitudinal Yeast Data
The longitudinal yeast data set, containing three strains of haploid S288c, grown for 448 generations under limited-glucose conditions.