Exploration Metrics for Reinforcement Learning

The dataset used in the paper is a set of data generated from four different types of distributions: uniform, truncated normal, bi-modal truncated normal growing scale, and bi-modal truncated normal moving locations.

BibTex: