-
Explore2Offline
The dataset used in the paper for offline reinforcement learning, consisting of task-agnostic exploration data collected via curiosity-based intrinsic motivation. -
Binary Tree MDP
The dataset used in the paper is a binary tree MDP, where the agent must execute a sequence of L uninterrupted UP movements. The dataset is used to test the Successor...