Automatic Curricula via Expert Demonstrations (ACED)

ACED constructs a curriculum by sampling states from expert demonstration trajectories as initializations for each training episode, where the samples initially come from near the end of the demonstration trajectories and gradually move forward as the agent improves its performance.

BibTex: