RWAVS

Real-world Audio-Visual Scene (RWAVS) dataset offers realistic multi-modal training samples constituting camera poses, high-quality binaural audios, and images.

BibTex: