-
Video Generation from Text Employing Latent Path Construction for Temporal Mo...
Video generation is one of the most challenging tasks in Machine Learning and Computer Vision fields of study. In this paper, we tackle the text to video generation problem,... -
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion ...
Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content. -
MSR-VTT and UCF-101
The dataset used in the paper is MSR-VTT and UCF-101, two public datasets for video-text generation. MSR-VTT contains 4,900 videos with 20 manually annotated captions for each... -
MTVG: Multi-text Video Generation with Text-to-Video Models
The authors used the pre-trained diffusion-based text-to-video (T2V) generation model without additional fine-tuning.