AVSBench

doi:doi:10.57702/n0dvi342

AVSBench

Audio-visual segmentation (AVS) aims to segment sound sources in the video sequence, requiring a pixel-level understanding of audio-visual correspondence.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Juhyeong Seon, Woobin Im, Sebin Lee, Jumin Lee, Sung-Eui Yoon (2024). Dataset: AVSBench. https://doi.org/10.57702/n0dvi342

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2403.11074
Citation	https://doi.org/10.48550/arXiv.2406.06163 https://doi.org/10.48550/arXiv.2311.04066
Author	Juhyeong Seon
More Authors	Woobin Im Sebin Lee Jumin Lee Sung-Eui Yoon
Homepage	https://arxiv.org/abs/2209.07929