Optimal batch size control for stochastic gradient descent

The dataset used in this paper is a continuous-time stochastic control problem for SGD and similar stochastic gradient descent algorithms.

BibTex: