Single Stream Parallelization of Recurrent Neural Networks for Low Power and Fast Inference

Single stream parallelization of recurrent neural networks for low power and fast inference

BibTex: