Markov Decision Process - Groups

Acrobot

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Dataset
JSON

Grid World

The dataset used in the paper is a reinforcement learning dataset, specifically a Markov Decision Process (MDP) with a finite set of states and actions.

Dataset
JSON

Forest management problem

The dataset used in this paper is a forest management problem, where the objective is to maintain an old forest for wildlife and make money by selling the cut wood.

Dataset
JSON

3 datasets found

Acrobot

Grid World

Forest management problem