The BRIDGE dataset is a collection of 155 deterministic MDPs, each with a horizon of 100 time steps. The dataset is used to evaluate the performance of reinforcement learning algorithms.
BibTex:
Before browse our site, please accept our cookies policy