-
Implicit Regularization of SGD with Preconditioning for Least Square Problems
The dataset used in the paper is a least squares regression problem instance. -
Optimal batch size control for stochastic gradient descent
The dataset used in this paper is a continuous-time stochastic control problem for SGD and similar stochastic gradient descent algorithms.