-
Model-based Offline Policy Optimization with Adversarial Network (MOAN)
Offline RL framework called MOAN, which introduces a two-player game to improve the generalization capability of the transition model and mitigate the negative effects of... -
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforceme...
This paper presents advanced techniques of training diffusion policies for offline reinforcement learning (RL).