-
MuJoCo AntGoal
The dataset used in the paper is the MuJoCo AntGoal environment, which is a variant of the AntGoal environment that uses sparse rewards. -
D4RL Benchmark
D4RL benchmark dataset, which consists of four offline logging datasets, collected by different one or mixed behavior policies.