Dataset - LDM

The Surprising Harmfulness of Benign Overﬁtting for Adversarial Robustness

The dataset is used to study the relationship between benign overﬁtting and adversarial robustness in machine learning models.
- Dataset
- JSON
A general theoretical paradigm to understand learning from human preferences

The paper proposes a novel approach to aligning language models with human preferences, focusing on the use of preference optimization in reward-free RLHF.
- Dataset
- JSON
Machine Learning and Bioinformatics for Diagnosis Analysis of Obesity Spectru...

The dataset used for diagnosis analysis of obesity spectrum disorders
- Dataset
- JSON
ArduCode: Predictive Framework for Automation Engineering

Two real datasets consisting of 2,927 Arduino projects and 683 Programmable Logic Controller (PLC) projects.
- Dataset
- JSON
Accelerating Deep Learning with Shrinkage and Recall

Deep Learning is a very powerful machine learning model. Deep Learning trains a large number of parameters for multiple layers and is very slow when data is in large scale and...
- Dataset
- JSON
Llama: Open and efficient foundation language models

The LLaMA dataset is a large language model dataset used in the paper.
- Dataset
- JSON
Colored MNIST dataset

The dataset used in the paper is a binary classification task in a 300-dimensional space. The procedure for generating the training dataset is as follows: Each label y ∈ {−1, 1}...
- Dataset
- JSON
Automatic Chemical Design Using a Data-Driven Continuous Representation of Mo...

A dataset for automatic chemical design using a data-driven continuous representation of molecules.
- Dataset
- JSON
LIME and SHAP explanations for issue type predictions

The dataset contains 3092 issues with the prediction whether they are a bug or not from the machine learning models and their corresponding LIME and SHAP explanations.
- Dataset
- JSON
Fusarium head blight detection in wheat under field conditions

A dataset used for detecting Fusarium head blight in wheat under field conditions using a hyperspectral camera and machine learning.
- Dataset
- JSON
Agreement ADOS database, Kaggle database, and self-gathered video test dataset

The AGRE ADOS database, Kaggle database, and a self-gathered video test dataset with corresponding ADOS data
- Dataset
- JSON
Malware Classification Dataset

The dataset used in this paper is a malware dataset containing 10,896 malware files belonging to 9 different malware families.
- Dataset
- JSON
Implicit Multigrid-Augmented DL for the Helmholtz Equation

The dataset used in this paper is a collection of slowness models for the Helmholtz equation, generated from the CIFAR-10, OpenFWI Style-A, and STL-10 datasets.
- Dataset
- JSON
3-d Gaussian Data

The dataset used in the paper is a 3-d Gaussian distributed dataset.
- Dataset
- JSON
Component Decoupled Data

The dataset used in the paper is a synthetic dataset generated by the component decoupled model described in Section 3.
- Dataset
- JSON
Malware dataset

The dataset consists of 20 malware families. Three of these malware families, namely, Winwebsec, Zeroaccess, and Zbot, are from the Malicia dataset, while the remaining 17...
- Dataset
- JSON
Gradient-based learning applied to document recognition

Gradient-based learning applied to document recognition.
- Dataset
- JSON
BEND: Bagging Deep Learning Training Based on Efficient Neural Network Diffusion

The paper proposes a Bagging Deep Learning Training Framework (BEND) based on efficient neural network diffusion.
- Dataset
- JSON
Koopcon: A new approach towards smarter and less complex learning

The dataset condensation problem involves transforming a large-scale training set X into a smaller synthetic set X'.
- Dataset
- JSON
Decoy-MNIST

The dataset used in the paper is a synthetic dataset similar to decoy-MNIST of Ross et al. (2017) with induced shortcuts and is presented in Section 5.2. For evaluation on...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

581 datasets found