2 datasets found

Tags: Speech-Driven

Filter Results
  • VOCASET

    The VOCASET dataset contains 480 paired audio and 3D facial motion sequences captured from 12 subjects.
  • BIWI

    BIWI dataset has total of 15,678 frames from 24 videos with 20 different subjects captured in the controlled indoor environment.