328 datasets found

Groups: Reinforcement Learning

Filter Results
  • Atari 57

    The Atari 57 benchmark is a collection of 57 Atari games, each with its own set of states and actions.
  • Four Rooms

    The Four Rooms environment is a stochastic version of the classic Atari game Four Rooms. The environment has 104 states and 4 actions, and the agent can move in any of the 4...
  • Soft Actor-Critic With Integer Actions

    Reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization.
  • Habitat

    The Habitat dataset is a large-scale indoor simulator dataset containing 145 semantically-annotated indoor scenes.
  • PinBall

    The PinBall domain is a continuous state domain where the agent must navigate a ball through a set of obstacles to reach the main goal, with a four-dimensional state space...
  • GridBall

    The GridBall domain is similar to the FourRooms domain, but change to be more like a grid-world to facilitate visualization. The velocity components of the state are removed,...
  • FourRooms

    The FourRooms domain is a continuous state domain where the agent navigates a ball through a set of obstacles to reach the main goal. The environment uses a four-dimensional...
  • Kuka Object Manipulation Datasets

    The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm.
  • Navigation and Manipulation Datasets

    The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm.
  • Giraffe

    The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.
  • Nerf++

    The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.
  • NeRF-RL

    The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.
  • Robomimic Environment

    Robomimic environment consists of tasks such as lift, can, square, tool-hang, and transport.
  • D4RL Benchmark Suite

    D4RL benchmark suite consists of tasks such as locomotion, antmaze, adroit, and kitchen.
  • Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

    Offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected...
  • Random Walk dataset

    The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the random walk exploration method.
  • RND dataset

    The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the RND exploration method.
  • SMM dataset

    The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the SMM exploration method.
  • ChronoGEM dataset

    The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the ChronoGEM exploration method.
  • Gridworld Dataset

    The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy.