-
Entropy-regularized Diffusion Policy with Q-Ensembles for Offline Reinforceme...
This paper presents advanced techniques of training diffusion policies for offline reinforcement learning (RL). -
Conservative data sharing for multi-task offline reinforcement learning
Conservative data sharing for multi-task offline reinforcement learning. -
Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning
Offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected... -
D4RL Benchmark
D4RL benchmark dataset, which consists of four offline logging datasets, collected by different one or mixed behavior policies.