VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation

A text-to-video generation approach, which can generate a high-definition video with high frame fidelity and strong temporal consistency using reference-guided latent diffusion.

Data and Resources

Cite this as

Xin Li, Wenqing Chu, Ye Wu, Weihang Yuan, Fanglong Liu, Qi Zhang, Fu Li, Haocheng Feng, Errui Ding, Jingdong Wang (2024). Dataset: VideoGen: A Reference-Guided Latent Diffusion Approach for High Definition Text-to-Video Generation. https://doi.org/10.57702/w7iyampy

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2309.00398
Author Xin Li
More Authors
Wenqing Chu
Ye Wu
Weihang Yuan
Fanglong Liu
Qi Zhang
Fu Li
Haocheng Feng
Errui Ding
Jingdong Wang
Homepage https://github.com/VideoGen/VideoGen