Continuous-Time Policy Gradient for Optimisation of Structured Neural Controller

The dataset used in the paper is a continuous-time policy gradient method for optimisation of structured neural controller.

BibTex: