-
Videofusion: Decomposed diffusion models for high-quality video generation
Videofusion: Decomposed diffusion models for high-quality video generation. -
CTRL-Adapter
A framework for adding diverse controls to any image/video diffusion model -
Videocrafter2: Overcoming data limitations for high-quality video diffusion m...
The dataset used in this paper for training and testing the Videocrafter2 model. -
Stable video diffusion: Scaling latent video diffusion models to large datasets
The dataset used in this paper for training and testing the SVD model. -
COCO2017-Val
The dataset used in this paper for training and testing the CV-VAE model. -
Kinetics-400 and Kinetics-600
The Kinetics-400 and Kinetics-600 datasets are video understanding datasets used for learning rich and multi-scale spatiotemporal semantics from high-dimensional videos. -
Kinetics-600
The Kinetics-600 dataset consists of 392k training videos and 30k validation videos in 600 human action categories. -
MotionMaster
The dataset used in the paper is a video dataset, which contains videos with different camera motions. -
CATER-GENs
Generating coherent and natural movement is the key challenge in video generation. This research proposes to condense video generation into a problem of motion generation, to... -
BAIR Robot Pushing
Generating coherent and natural movement is the key challenge in video generation. This research proposes to condense video generation into a problem of motion generation, to... -
DynamiCrafter
The DynamiCrafter dataset is a collection of open-domain images used for animating images with video diffusion priors. -
Make-Your-Anchor
A diffusion-based 2D avatar generation framework to produce realistic and high-quality anchor-style human videos. -
MTVG: Multi-text Video Generation with Text-to-Video Models
The authors used the pre-trained diffusion-based text-to-video (T2V) generation model without additional fine-tuning. -
UCF101 dataset
UCF101 dataset is used to test the proposed text-to-video model. The dataset contains 101 action categories, and each category has 10 videos. The videos are labeled with text... -
Generated Video Dataset (GVD)
A large-scale generated video benchmark dataset for network training and evaluation, comprising synthetic videos from 11 different generator models.