6 datasets found

Tags: Audio-Visual

Filter Results
  • Sub-URMP

    A high-resolution landscape video dataset with audio-visual pairs for sound-guided video generation task.
  • MEAD and HDTF datasets

    MEAD and HDTF datasets are used for training and testing the proposed SAAS model.
  • VGGSound

    The VGGSound dataset is a large-scale audio-visual dataset containing 10,000 10-second video clips with corresponding audio files.
  • DFDC

    Face forgery by deepfake is widely spread over the internet and has raised severe societal concerns. Recently, how to detect such forgery contents has become a hot research...
  • HDTF

    The dataset used in the paper for 3D head avatar reconstruction from monocular RGB videos.
  • MEAD

    The MEAD dataset is a large-scale, high-quality emotional audio-visual dataset, which consists of 60 actors, including 8 basic emotions and 3 different emotional-intensity...
You can also access this registry using the API (see API Docs).