-
HalfCheetahDir
The dataset used in the paper is the HalfCheetahDir environment, which is a variant of the HalfCheetah environment that uses sparse rewards. -
MuJoCo AntGoal
The dataset used in the paper is the MuJoCo AntGoal environment, which is a variant of the AntGoal environment that uses sparse rewards. -
ELIGN: Expectation Alignment as a Multi-Agent Intrinsic Reward
Modern multi-agent reinforcement learning frameworks rely on centralized training and reward shaping to perform well. However, centralized training and dense rewards are not... -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.