-
OPIRL: Sample Efficient Off-Policy Inverse Reinforcement Learning via Distrib...
The OPIRL dataset is used for training and testing the Off-Policy Inverse Reinforcement Learning (OPIRL) algorithm. -
Soft Actor-Critic
A soft actor-critic algorithm for off-policy maximum entropy deep reinforcement learning.