Solving Robust MDPs through No-Regret Dynamics

The Robust MDPs problem is a Markov Decision Process problem where the goal is to find a policy π that maximizes the Value Function under worst-case transition dynamics.

Data and Resources

Cite this as

Etash Kumar Guha, Jason D. Lee (2024). Dataset: Solving Robust MDPs through No-Regret Dynamics. https://doi.org/10.57702/2ay07f1i

DOI retrieved: December 17, 2024

Additional Info

Field Value
Created December 17, 2024
Last update December 17, 2024
Defined In https://doi.org/10.48550/arXiv.2305.19035
Author Etash Kumar Guha
More Authors
Jason D. Lee