Video Editing - Groups

Learning to Cut

A dataset of more than 10K videos, from which more than 255K cuts are extracted.

Dataset
JSON

UniEdit

UniEdit: A Unified Tuning-Free Framework for Video Motion and Appearance Editing

Dataset
JSON

OpenShot

The dataset used for speech-to-speech translation, lip-syncing, and talking face generation.

Dataset
JSON

VideoMap: Supporting Video Editing Exploration, Brainstorming, and Prototypin...

VideoMap is a proof-of-concept video editing interface that operates on video frames projected onto a latent space, enabling users to visually uncover patterns and relationships.

Dataset
JSON

InstructVid2Vid dataset

The dataset used for training the InstructVid2Vid model, which consists of video-instruction-edited video triplets.

Dataset
JSON

VASE

The VASE dataset is used for video object-centric appearance and shape edits. It consists of 30 videos from the DAVIS dataset, with 3 driver images to edit each video.

Dataset
JSON

MotionFollower

The dataset used in the paper is not explicitly described, but it is mentioned that the authors collect 3K videos (60-90 seconds long) from the internet to train their model.

Dataset
JSON

MaskINT: Video Editing via Interpolative Non-autoregressive Masked Transformers

Text-based video editing using MaskINT, a two-stage pipeline involving keyframe joint editing and structure-aware frame interpolation.

Dataset
JSON

Emu Video Edit Training Dataset

The Emu Video Edit model's training dataset, containing 1600 videos with 7 editing instructions each.

Dataset
JSON

Emu Video Edit Dataset

The dataset used for training the Emu Video Edit model, containing 1600 videos.

Dataset
JSON

EVE: Efficient zero-shot text-based Video Editing

Zero-shot text-based video editing with depth map guidance and temporal consistency constraints

Dataset
JSON

Make-a-protagonist: Generic video editing with an ensemble of experts

Make-a-protagonist: Generic video editing with an ensemble of experts.

Dataset
JSON

Zero-shot video editing using off-the-shelf image diffusion models

Zero-shot video editing using off-the-shelf image diffusion models.

Dataset
JSON

ControlVideo

ControlVideo is a general framework to utilize T2I diffusion models for one-shot video editing, which incorporates additional conditions such as edge maps, the key frame and...

Dataset
JSON

Tune-A-Video

The dataset used in the paper for video editing tasks

Dataset
JSON

DAVIS

The DAVIS dataset is a widely used dataset for video-related tasks, consisting of approximately 2000 frames from 26 human-centric scenarios.

Dataset
JSON

EI2 model for text-driven video editing

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used the DAVIS dataset and the Pexels website to gather face videos.

Dataset
JSON

Show-1

The Show-1 dataset is a text-to-video diffusion model.

Dataset
JSON

Zeroscope

The dataset used in this paper for text-to-video generation, consisting of short video clips.

Dataset
JSON

Davis and WebVid datasets

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used 26 text-video pairs from the public DAVIS and WebVid datasets.

Dataset
JSON

20 datasets found