Text-to-Video - Groups

Robotic Videos Dataset

Robotic Videos dataset contains videos and corresponding user-to-robot textual commands.

Dataset
JSON

Video Generation from Text Employing Latent Path Construction for Temporal Mo...

Video generation is one of the most challenging tasks in Machine Learning and Computer Vision fields of study. In this paper, we tackle the text to video generation problem,...

Dataset
JSON

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion ...

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

Dataset
JSON

MSR-VTT and UCF-101

The dataset used in the paper is MSR-VTT and UCF-101, two public datasets for video-text generation. MSR-VTT contains 4,900 videos with 20 manually annotated captions for each...

Dataset
JSON

ModelScope text-to-video

The dataset used in the paper for text-to-video diffusion models

Dataset
JSON

MTVG: Multi-text Video Generation with Text-to-Video Models

The authors used the pre-trained diffusion-based text-to-video (T2V) generation model without additional fine-tuning.

Dataset
JSON