No Organization - Organizations

Mountain Climbing Tasks

The dataset used in the paper is a set of Mountain Climbing tasks, which are a collection of tasks that involve climbing a mountain using a robotic arm.

Dataset
JSON

Object-pusher environment

The dataset used in the paper is a simulated object-pusher environment.

Dataset
JSON

Target Stacking

A synthetic block stacking environment with physics simulation in which the agent can learn block stacking end-to-end through trial and error, bypassing to explicitly model the...

Dataset
JSON

Reinforcement learning to optimize long-term user engagement in recommender s...

A method for optimizing long-term user engagement in recommender systems.

Dataset
JSON

Rethinking reinforcement learning for recommendation: A prompt perspective

A prompt-based approach for sequential recommendation.

Dataset
JSON

DARLEI: Deep Accelerated Reinforcement Learning with Evolutionary Intelligence

DARLEI is a framework that combines evolutionary algorithms with parallelized reinforcement learning for efficiently training and evolving populations of UNIMAL agents.

Dataset
JSON

Acrobot

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Dataset
JSON

Grid World

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Dataset
JSON

Human-Human Trajectories

The dataset used in the paper is a set of human-human trajectories for training a behavioral cloning model.

Dataset
JSON

Temporally Layered Architecture (TLA) for Adaptive, Distributed and Continuou...

The dataset used in the Temporally Layered Architecture (TLA) for adaptive, distributed and continuous control.

Dataset
JSON

Self-Learning Search Engine (SLSE) dataset

The dataset used in this paper is a multimedia search engine dataset, which is a Self-Learning Search Engine (SLSE) architecture based on reinforcement learning.

Dataset
JSON

Waypoints and Edges

The dataset used in the paper is a set of waypoints and edges for planning.

Dataset
JSON

2D Environment

The dataset used in the paper is a 2D environment where experiments are done.

Dataset
JSON

Policy Gradients using Variational Quantum Circuits

Variational Quantum Circuits are being used as versatile Quantum Machine Learning models. Some empirical results exhibit an advantage in supervised and generative learning...

Dataset
JSON

Replay Buffer

The dataset used in the paper is a replay buffer containing observations from a navigation task.

Dataset
JSON

Bank Heist

The Bank Heist environment is a 2D maze with four rooms, where the objective is to navigate to banks distributed across the four mazes.

Dataset
JSON

Noisy MNIST

The MNIST environment does not elicit any actions from an agent. Instead, the prediction network simply needs to learn one step mappings between pairs of MNIST handwritten digits.

Dataset
JSON

MuJoCo Environment

The dataset used in the paper is a MuJoCo environment, with 13-states and 4-control inputs, nonlinear dynamics with polynomial dependency in the control inputs.

Dataset
JSON

Corridor Environment

The corridor environment is a simple environment where the agent has to determine whether the rewarding cell (colored yellow) is at the top or bottom, based on the color of the...

Dataset
JSON

Policy Optimization for Low-rank MDPs (POLO)

Learning Adversarial Low-rank Markov Decision Processes with Unknown Transition and Full-information Feedback

Dataset
JSON

397 datasets found