You're currently viewing an old version of this dataset. To see the current version, click here.

RND dataset

The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the RND exploration method.

Data and Resources

Cite this as

Alexis Jacq, Manu Orsini, Gabriel Dulac-Arnold, Olivier Pietquin, Matthieu Geist, Olivier Bachem (2024). Dataset: RND dataset. https://doi.org/10.57702/nxav3pkf

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.48550/arXiv.2211.03521
Author Alexis Jacq
More Authors
Manu Orsini
Gabriel Dulac-Arnold
Olivier Pietquin
Matthieu Geist
Olivier Bachem