Duckietown environment

The dataset used in this paper is a collection of state-action pairs generated by a pre-trained RL agent, used to train a self-supervised interpretable network (SSINet) to produce attention masks for explaining the agent's decisions.

BibTex: