Solving Robust MDPs through No-Regret Dynamics

doi:doi:10.57702/2ay07f1i

Solving Robust MDPs through No-Regret Dynamics

The Robust MDPs problem is a Markov Decision Process problem where the goal is to find a policy π that maximizes the Value Function under worst-case transition dynamics.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Etash Kumar Guha, Jason D. Lee (2024). Dataset: Solving Robust MDPs through No-Regret Dynamics. https://doi.org/10.57702/2ay07f1i

DOI retrieved: December 17, 2024

Additional Info

Field	Value
Created	December 17, 2024
Last update	December 17, 2024
Defined In	https://doi.org/10.48550/arXiv.2305.19035
Author	Etash Kumar Guha
More Authors	Jason D. Lee