-
Continuous-Time Policy Gradient for Optimisation of Structured Neural Controller
The dataset used in the paper is a continuous-time policy gradient method for optimisation of structured neural controller. -
Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-...
The dataset used in this paper is a set of instances with attributes such as time till due date, job transport distance, job volume, holding cost, penalty failed job, and... -
OpenAI Gym
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used several continuous control environments from the OpenAI Gym.