Text2video-zero: Text-to-image diffusion models are zero-shot video generators

Text2video-zero: Text-to-image diffusion models are zero-shot video generators.

BibTex: