Expert Demonstrations
The expert demonstrations are generated according to the given optimal policy for the recovery. The length of each expert demonstration is 5-grid size trajectory length. Four algorithms for the evaluation includes MaxEnt, DeepMaxEnt, SIRL, and DSIRL.
BibTex: