Toy Example Dataset

The dataset used in the paper is a toy example, consisting of a 10x10 grid world, with the agent at position (0, 0). Obstacles are randomly positioned, at an obstacle to free position ratio of 0.2. The agent is presented a plan π (an action sequence) of 10 movements (up, down, left, right, with obvious semantics). The agent has a Bernoulli action failure probability pfail uniformly sampled from [0; 1]. Action failure results in the inverse movement (e.g. failing up yields down). The agent is presented a number of observations about its failure probability before running BV.

Data and Resources

Cite this as

Xin Li (2024). Dataset: Toy Example Dataset. https://doi.org/10.57702/szo7pq1x

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.1702.08725
Citation
  • https://doi.org/10.48550/arXiv.2404.01183
Author Xin Li
Homepage https://github.com/jazzbob/bv