-
UCI+ repository
Characterizes the complexity of classification problems using various measures. -
IRI Academic Data Set
The IRI Academic Data Set is a collection of real-world transaction records of store sales and consumer panels for thirty product categories, and includes sales information for... -
scX: A user-friendly tool for scRNA-seq exploration
Single-cell RNA sequencing (scRNA-seq) has transformed our ability to explore biological systems. Nevertheless, proficient expertise is essential for handling and interpreting... -
Market Basket Analysis Dataset
The dataset contains the purchases of anonymous households in chain grocery and drug stores. -
Bike Sharing Dataset
The bike sharing dataset is an hourly time series data for bike rentals in the Capital Bikeshare system between years 2011 and 2012. -
Bigeometric Organization of Deep Nets
The dataset is used to demonstrate the bigeometric organization of deep nets. It contains hospital quality data from the Center for Medicare and Medicaid Services Hospital... -
Nonparametric Data Analysis on the Space of Perceived Colors
The dataset is used for nonparametric data analysis on the space of perceived colors. -
Lakeland Game Telemetry Data
The dataset used in this paper is a collection of game telemetry data from the game Lakeland, which is an open-ended, achievement-driven game. -
Lederman-Talmon Dataset
The dataset used in the paper is a dataset from Lederman and Talmon. It contains two figurines rotating at different speeds. -
Kuramoto-Sivashinsky Equation
The Kuramoto-Sivashinsky equation is a model for spatiotemporal chaos. The Kuramoto-Sivashinsky equation is expressed as: yt = −yyx − yxx − yxxxx -
Genus Two Surface
The dataset used in the paper is a synthetic dataset of a genus two surface. It contains a densely sampled surface of genus two. -
Synthetic Neuroscience Example
The dataset used in the paper is a synthetic dataset inspired by the types of neuronal responses to stimuli. It contains three populations of neurons tuned to elevation,... -
Multiple Penalized Principal Curves: Analysis and Computation
The dataset is used to test the proposed approach for finding one-dimensional structures in data. -
SAARC Panel Data
The dataset used in this paper is a panel data on the 7 membership countries of SAARC. -
Large-scale Machine Learning Dataset
The dataset used in this paper is a large-scale machine learning dataset. -
EnCoD: Distinguishing Compressed and Encrypted File Fragments
A large, standardized dataset of encrypted and compressed fragments covering various popular file formats and fragment sizes. -
Breast Cancer Dataset
Breast cancer dataset where mammograms have been labeled independently by three doctors. Ground-truth has been obtained through a biopsy, not available to the algorithm nor the... -
Dry Bean Dataset
The Dry Bean Dataset is a collection of data related to dry bean yield and quality. -
5 Tabular Datasets
The dataset used in the paper is a collection of 5 tabular datasets available in the scikit-learn toolkit: boston (D=13, n=506), iris (D=4, n=150), diabetes (D=10, n=442), wine...