AVSBench

Audio-visual segmentation (AVS) aims to segment sound sources in the video sequence, requiring a pixel-level understanding of audio-visual correspondence.

Data and Resources

Cite this as

Juhyeong Seon, Woobin Im, Sebin Lee, Jumin Lee, Sung-Eui Yoon (2024). Dataset: AVSBench. https://doi.org/10.57702/n0dvi342

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2403.11074
Citation
  • https://doi.org/10.48550/arXiv.2406.06163
  • https://doi.org/10.48550/arXiv.2311.04066
Author Juhyeong Seon
More Authors
Woobin Im
Sebin Lee
Jumin Lee
Sung-Eui Yoon
Homepage https://arxiv.org/abs/2209.07929