UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models

doi:doi:10.57702/y68gj7ic

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Xuweiyi Chen, Tian Xia, Sihan Xu (2024). Dataset: UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion Models. https://doi.org/10.57702/y68gj7ic

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2403.02332
Author	Xuweiyi Chen
More Authors	Tian Xia Sihan Xu