Off-Policy Deep Reinforcement Learning without Exploration

doi:doi:10.57702/vkcnqiqb

You're currently viewing an old version of this dataset. To see the current version, click here.

Off-Policy Deep Reinforcement Learning without Exploration

The dataset used in the paper is a batch of data collected from a fixed batch of data which has already been gathered, without offering further possibility for data collection.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Scott Fujimoto, David Meger, Doina Precup (2024). Dataset: Off-Policy Deep Reinforcement Learning without Exploration. https://doi.org/10.57702/vkcnqiqb

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Author	Scott Fujimoto
More Authors	David Meger Doina Precup
Homepage	https://arxiv.org/abs/1909.09637