SoundNet

The dataset is used for learning general and effective models for both audio and video analysis from self-supervised temporal synchronization.

BibTex: