Stochastic MDP

doi:doi:10.57702/pdj6emd3

Stochastic MDP

Followers: 0

Organization

No Organization

There is no description for this organization

License

No License Provided

Export

DCAT(rdf/xml) DCAT(xml) DCAT(N3) DCAT(ttl) DCAT(jsonld) DataCite CSL DublinCore BibTex

Stochastic MDP

The dataset used in this paper is a stochastic MDP with |S| = 4 and |A| = 4. One of the states is set to the terminal state, and one of the rest is set to the starting state. The transition probability and reward functions are randomly generated.

BibTex:

Before browse our site, please accept our cookies policy