Policy Optimization for Stochastic Shortest Path

doi:doi:10.57702/tsssfkfz

You're currently viewing an old version of this dataset. To see the current version, click here.

Policy Optimization for Stochastic Shortest Path

Policy optimization for stochastic shortest path (SSP) problem, a goal-oriented reinforcement learning model that strictly generalizes the finite-horizon model and better captures many applications.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Liyu Chen, Haipeng Luo, Aviv Rosenberg (2024). Dataset: Policy Optimization for Stochastic Shortest Path. https://doi.org/10.57702/tsssfkfz

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2202.03334
Author	Liyu Chen
More Authors	Haipeng Luo Aviv Rosenberg
Homepage	https://arxiv.org/abs/2112.09859