2 datasets found

Tags: pixel-wise annotations

Filter Results
  • AVSBench

    Audio-visual segmentation (AVS) aims to segment sound sources in the video sequence, requiring a pixel-level understanding of audio-visual correspondence.
  • Cityscapes

    The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
You can also access this registry using the API (see API Docs).