You're currently viewing an old version of this dataset. To see the current version, click here.

Synthetic Data

The dataset used in the paper is a synthetic dataset for off-policy contextual bandits, with contexts x ∈ X, a finite set of actions A, and bounded real rewards r ∈ A → [0, 1].

Data and Resources

Cite this as

Gemma E. Moran, Dhanya Sridhar, Yixin Wang, David M. Blei (2024). Dataset: Synthetic Data. https://doi.org/10.57702/ud58iifq

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2012.04580
Citation
  • https://doi.org/10.48550/arXiv.2401.02665
  • https://doi.org/10.48550/arXiv.2008.07720
  • https://doi.org/10.48550/arXiv.1906.03323
  • https://doi.org/10.48550/arXiv.2405.05430
  • https://doi.org/10.48550/arXiv.2110.10804
  • https://doi.org/10.48550/arXiv.2002.01599
  • https://doi.org/10.48550/arXiv.2206.05490
  • https://doi.org/10.48550/arXiv.1904.03483
  • https://doi.org/10.1016/j.neunet.2019.04.020
  • https://doi.org/10.48550/arXiv.2308.08321
Author Gemma E. Moran
More Authors
Dhanya Sridhar
Yixin Wang
David M. Blei
Homepage https://arxiv.org/abs/2202.03465