Dataset - LDM

Sub-URMP

A high-resolution landscape video dataset with audio-visual pairs for sound-guided video generation task.
- Dataset
- JSON
MEAD and HDTF datasets

MEAD and HDTF datasets are used for training and testing the proposed SAAS model.
- Dataset
- JSON
VGGSound

The VGGSound dataset is a large-scale audio-visual dataset containing 10,000 10-second video clips with corresponding audio files.
- Dataset
- JSON
DFDC

Face forgery by deepfake is widely spread over the internet and has raised severe societal concerns. Recently, how to detect such forgery contents has become a hot research...
- Dataset
- JSON
HDTF

The dataset used in the paper for 3D head avatar reconstruction from monocular RGB videos.
- Dataset
- JSON
MEAD

The MEAD dataset is a large-scale, high-quality emotional audio-visual dataset, which consists of 60 actors, including 8 basic emotions and 3 different emotional-intensity...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

6 datasets found