Multitask Representation Learning in Linear MDPs

The dataset is used for multitask representation learning in linear MDPs. It contains 80 different tasks, each with a different destination position, fire configuration, and action deviation probability p.

BibTex: