Broken Neural Scaling Laws

A smoothly broken power law functional form that accurately models and extrapolates the scaling behaviors of deep neural networks for various architectures and tasks.

BibTex: