-
Agglomerative Clustering of Simulation Output Distributions
The dataset is used for clustering simulation output distributions using the regularized Wasserstein distance. -
Two-Sample Testing Using Projected Wasserstein Distance
Two-sample test using projected Wasserstein distance for the two-sample test, a fundamental problem in statistics and machine learning: given two sets of samples, to determine... -
Optimal Transport Modeling
The dataset used in the paper is a noise distribution and a high-dimensional data distribution. -
Wasserstein-2 Benchmark
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a high-dimensional continuous distribution p0, p1 for which the ground truth... -
Koopcon: A new approach towards smarter and less complex learning
The dataset condensation problem involves transforming a large-scale training set X into a smaller synthetic set X'. -
Synthetic MNIST dataset
The dataset used in the paper is a synthetic MNIST dataset generated by forming barycenters constructed with weights sampled uniformly from ∆3. -
SimpleQuestion dataset for Wikidata
The dataset used in this paper is a reinforcement learning dataset, specifically the SimpleQuestion dataset, which contains questions answerable using Wikidata as the knowledge...