-
Netflix Dataset
The dataset used in the paper is a Netflix dataset, which is a large-scale matrix factorization problem. -
Matrix Completion using SGD
The dataset used in the paper is a matrix completion problem, where the goal is to predict missing entries in a matrix based on known entries. -
Smart card dataset of bus ridership
Three-month smart card dataset of bus ridership, containing over 10 million observations allied with detailed weather measurements, trip length, calendar events, and built... -
Sharp Frequency Bounds for Sample-Based Queries
The dataset used in this paper is a big data set, and the authors use a data sketch algorithm to statistically infer probably approximately correct (PAC) bounds for frequencies... -
Journal of Big Data 2016
The dataset used in the paper is a collection of computer science journal articles, specifically the Journal of Big Data 2016. -
RADAR (Research Data Repository)
Imported
Data set for the population survey “attitudes towards big data practices and ...
Abstract: The aim of this study is to gain insights into the attitudes of the population towards big data practices and the factors influencing them. To this end, a nationwide...