-
Training Data and Runtime Monitoring for Safety Critical ML Applications
The dataset used in the study on challenges encountered when specifying training data and runtime monitors for safety critical ML applications. -
Quantum Neural Networks
The dataset used in this paper is a collection of quantum neural network models, including VQA, CV, swap test and phase estimation, RUS, quantum generalization, QBM, QCVNN,... -
Best-scored Random Forest
The dataset used in this paper is a binary classification problem. -
Adversarial Data Programming: Using GANs to Relax the Bottleneck of Curated L...
Paucity of large curated hand-labeled training data for every domain-of-interest forms a major bottleneck in the deployment of machine learning models in computer vision and... -
Ensemble Transform Kalman Filter (ETKF) for Data Assimilation
The dataset used in this paper is a set of synthetic data for the 3-variable Lorenz system and for the Kuramoto-Sivashinsky system, simulating model error in each case by a... -
Gradient Adversarial Training
The dataset used for gradient adversarial training of neural networks. -
HELOC Dataset
The HELOC dataset is a multivariate dataset containing information about home credit applications. It includes variables such as external risk estimate, MSinceOldestTradeOpen,... -
FOLD-TR: A Scalable and Efficient Inductive Learning Algorithm for Learning t...
FOLD-TR is a customized FOLD-R++ algorithm with ranking framework, that aims to rank new items following the ranking pattern in the training data. -
Iterative Retraining Dataset
The dataset used for the iterative retraining experiments, which includes 20% augmented training and validation sets. -
TaxiNet Dataset
The dataset used for the TaxiNet case study, which includes 6-dimensional semantic feature space defined by SCENIC programs and searched by VERIFAI. -
Fashion-MNIST, CIFAR-10, and GTSRB datasets
The Fashion-MNIST, CIFAR-10, and GTSRB datasets were used to evaluate differentiable logics for learning systems. -
BRITE Light Curves
The BRITE light curves were used as the training samples of non-transit light curves. The transit signals were injected into the BRITE light curves to produce synthetic transit... -
20 Newsgroups Text Classification Dataset
The dataset used in this paper is a collection of 20 Newsgroups text classification problems. -
Wine Quality Classification Dataset
The dataset used in this paper is a collection of wine quality classification problems. -
Ionosphere Classification Dataset
The dataset used in this paper is a collection of ionosphere classification problems. -
Mushroom Classification Dataset
The dataset used in the paper is a mushroom classification problem with 8124 instances. -
Logistic Regression Dataset
The dataset used in this paper is a collection of logistic regression problems. -
What does Big mean in the realm of materials science data?
Big data has ushered in a new wave of predictive power using machine learning models. In this work, we assess what big means in the context of typical materials-science... -
NeurIPS 2019 Retrospectives Workshop
The dataset used in the NeurIPS 2019 Retrospectives workshop to discuss ideas for improving the field of machine learning. -
MLPerf Benchmark Dataset
The dataset used in this paper is the MLPerf benchmark dataset.