-
Big Traffic Database
The dataset is a big traffic database constructed by simulating traffic data to cover a variety of traffic situations. -
Netflix Dataset
The dataset used in the paper is a Netflix dataset, which is a large-scale matrix factorization problem. -
Matrix Completion using SGD
The dataset used in the paper is a matrix completion problem, where the goal is to predict missing entries in a matrix based on known entries. -
Smart card dataset of bus ridership
Three-month smart card dataset of bus ridership, containing over 10 million observations allied with detailed weather measurements, trip length, calendar events, and built... -
Sharp Frequency Bounds for Sample-Based Queries
The dataset used in this paper is a big data set, and the authors use a data sketch algorithm to statistically infer probably approximately correct (PAC) bounds for frequencies... -
Journal of Big Data 2016
The dataset used in the paper is a collection of computer science journal articles, specifically the Journal of Big Data 2016.