2 datasets found

Tags: Continuous Control

Filter Results
  • Acrobot

    The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.
  • Mountain Car

    The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.