-
Chain of Interaction Skills
The dataset used in the paper is a robotic pushing domain with negative reward regions, and variants of the video game Breakout. -
Room Clearance with Feudal Hierarchical Reinforcement Learning
A new simulation environment designed as a simple testbed for demonstrating the utility of RL as a tool for concept analysis with military applications as well as to aid with...