-
ParaBank 2
ParaBank 2 is a large synthetic paraphrase dataset created by translating one side of bitext into the language of the other side. -
LLM Synthetic Dataset
The dataset is used to evaluate the performance of the 𝛼-RNN model on various time series tasks. -
Synthetic Test Functions
The dataset used in the paper is a set of synthetic test functions, which are used to evaluate the performance of the Deterministic Langevin Optimization algorithm. -
Synthetic data 2
Synthetic data 2 -
Synthetic data 1
Synthetic data 1 -
The Simulacrum
Simulacrum dataset, a synthetic dataset for medical research, generated using a directed graph and conditional distributions -
Synthetic Datasets for Instance-Level Fidelity Evaluation
Four synthetic datasets aligned with the real-world AV dataset KITTI. -
Virtual KITTI 2
The Virtual KITTI 2 dataset is a synthetic clone of the real KITTI dataset, containing 5 sequence clones of Scene 01, 02, 06, 18 and 20, and nine variants with diverse weather... -
Synthetic Data Generation for Variational Autoencoders
Synthetic swaption cubes generated from existing ones -
Synthetic Data Set
The dataset generated by the method of moments (MoM) for training supervised NeuralBIM. -
Synthetic Laser Reliability Data
A dataset for training machine learning models for predictive maintenance of semiconductor lasers -
Simulated dataset
The dataset used in this paper is a simulated dataset with 200 variables and 50 observations. The variables are generated from a multivariate normal distribution with a... -
Synthetic Data
The dataset used in the paper is a synthetic dataset for off-policy contextual bandits, with contexts x ∈ X, a finite set of actions A, and bounded real rewards r ∈ A → [0, 1]. -
Synthetic Dataset
The dataset used in this work is a custom synthetic dataset generated using the liquid-dsp library, containing 600000 examples of each of 13.8 million examples, with SNRs... -
Resyris - a real-synthetic rock instance segmentation dataset for training an...
A real-synthetic rock instance segmentation dataset for training and benchmarking.