VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

A text-to-video generation approach, which can generate a high-definition video with high frame fidelity and strong temporal consistency using reference-guided latent diffusion.

BibTex: