Video Generation - Groups

InterVid-14M-aesthetics

The dataset used in the paper is InterVid-14M-aesthetics, which is a subset of InterVid-14M used to remove watermarks from generated videos.

Dataset
JSON

Video Generation from Text Employing Latent Path Construction for Temporal Mo...

Video generation is one of the most challenging tasks in Machine Learning and Computer Vision fields of study. In this paper, we tackle the text to video generation problem,...

Dataset
JSON

Video Generative Patch Nearest Neighbors (VGPNN)

A non-parametric approach for video generation from a single video, outperforming single-video GANs in visual quality and realism.

Dataset
JSON

UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion ...

Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content.

Dataset
JSON

Sub-URMP

A high-resolution landscape video dataset with audio-visual pairs for sound-guided video generation task.

Dataset
JSON

Mora: Enabling Generalist Video Generation via a Multi-Agent Framework

A video dataset for training a generalist video generation model.

Dataset
JSON

RefDrop

The dataset used in the paper for consistent image generation and video generation.

Dataset
JSON

DreamVideo: Composing Your Dream Videos with Customized Subject and Motion

Customized generation using diffusion models has made impressive progress in image generation, but remains un-satisfactory in the challenging video generation task, as it...

Dataset
JSON

Lumiere

A dataset of 30M videos along with their text captions.

Dataset
JSON

S2DM: Sector-Shaped Diffusion Models for Video Generation

Diffusion models have achieved great success in image generation. However, when leveraging this idea for video generation, we face significant challenges in maintaining the...

Dataset
JSON

StyleVideoGAN

StyleVideoGAN: A Temporal Generative Model using a Pretrained StyleGAN

Dataset
JSON

MOFA-Video

MOFA-Video is a controllable image animation method that generates video from the given image using various additional controllable signals.

Dataset
JSON

MSR-VTT and UCF-101

The dataset used in the paper is MSR-VTT and UCF-101, two public datasets for video-text generation. MSR-VTT contains 4,900 videos with 20 manually annotated captions for each...

Dataset
JSON

SoloDance Dataset

The SoloDance dataset contains 179 solo dance videos in real scenes collected online.

Dataset
JSON

iPER Dataset

The iPER dataset was proposed by [25], which was collected in the laboratory environment.

Dataset
JSON

REMOT: A Region-to-Whole Framework for Realistic Human Motion Transfer

Human Video Motion Transfer (HVMT) aims to, given an image of a source person, generate his/her video that imitates the motion of the driving person.

Dataset
JSON

Events-to-Video: Bringing Modern Computer Vision to Event Cameras

E2VID is an event-to-video pipeline that converts event data into a video sequence.

Dataset
JSON

CLEVRER

The CLEVRER dataset is a dataset for video reasoning, where videos are presented to the model, along with a set of related questions, and the model's outputs are the answers to...

Dataset
JSON

Sky Time-lapse

Sky Time-lapse for video generation

Dataset
JSON

TaiChi-HD

TaiChi-HD for video generation

Dataset
JSON

60 datasets found