Reinforcement Learning - Groups

Atari 57

The Atari 57 benchmark is a collection of 57 Atari games, each with its own set of states and actions.

Dataset
JSON

Four Rooms

The Four Rooms environment is a stochastic version of the classic Atari game Four Rooms. The environment has 104 states and 4 actions, and the agent can move in any of the 4...

Dataset
JSON

Soft Actor-Critic With Integer Actions

Reinforcement learning under integer actions by incorporating the Soft Actor-Critic (SAC) algorithm with an integer reparameterization.

Dataset
JSON

Habitat

The Habitat dataset is a large-scale indoor simulator dataset containing 145 semantically-annotated indoor scenes.

Dataset
JSON

PinBall

The PinBall domain is a continuous state domain where the agent must navigate a ball through a set of obstacles to reach the main goal, with a four-dimensional state space...

Dataset
JSON

GridBall

The GridBall domain is similar to the FourRooms domain, but change to be more like a grid-world to facilitate visualization. The velocity components of the state are removed,...

Dataset
JSON

FourRooms

The FourRooms domain is a continuous state domain where the agent navigates a ball through a set of obstacles to reach the main goal. The environment uses a four-dimensional...

Dataset
JSON

Kuka Object Manipulation Datasets

The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm.

Dataset
JSON

Navigation and Manipulation Datasets

The dataset is used for training and testing the Equivariant Diffuser for Generating Interactions (EDGI) algorithm.

Dataset
JSON

Giraffe

The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.

Dataset
JSON

Nerf++

The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.

Dataset
JSON

NeRF-RL

The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.

Dataset
JSON

Robomimic Environment

Robomimic environment consists of tasks such as lift, can, square, tool-hang, and transport.

Dataset
JSON

D4RL Benchmark Suite

D4RL benchmark suite consists of tasks such as locomotion, antmaze, adroit, and kitchen.

Dataset
JSON

Action-Quantized Offline Reinforcement Learning for Robotic Skill Learning

Offline reinforcement learning (RL) paradigm provides a general recipe to convert static behavior datasets into policies that can perform better than the policy that collected...

Dataset
JSON

Random Walk dataset

The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the random walk exploration method.

Dataset
JSON

RND dataset

The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the RND exploration method.

Dataset
JSON

SMM dataset

The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the SMM exploration method.

Dataset
JSON

ChronoGEM dataset

The dataset used in the paper is a collection of states sampled from a Markov Decision Process (MDP) using the ChronoGEM exploration method.

Dataset
JSON

Gridworld Dataset

The dataset used for the Gridworld tasks, consisting of 10K episodes of the agent following a uniform random policy.

Dataset
JSON

328 datasets found