-
Time Limits in Reinforcement Learning
The dataset used in the paper is a reinforcement learning dataset, specifically for time-limited tasks and time-unlimited tasks. -
Reinforcement Learning with Convex Constraints
The dataset used in the paper is a reinforcement learning problem with arbitrary convex constraints. -
RL Boosting via Weak Supervised Learning
The dataset used in the paper is a reinforcement learning dataset, where the goal is to learn a policy that maximizes the expected return in a Markov decision process. -
Monas: Multi-objective neural architecture search using reinforcement learning
The authors propose a multi-objective neural architecture search using reinforcement learning. -
Markov Decision Process
The dataset used in the paper is a Markov Decision Process, where states can take values in a state space X, corresponding to a state x ∈ X, we can take an action u ∈ U,... -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.