Dual Policy Distillation

The dataset used in the paper is a continuous control task dataset.

BibTex: