Self-supervised Relational RL with Independently Controllable Subgoals
The dataset used in the paper is a multi-object environment with a robotic arm and multiple objects to manipulate. The agent learns to control the objects independently and solve a compositional goal.
BibTex: