Dataset - LDM

AVSBench

Audio-visual segmentation (AVS) aims to segment sound sources in the video sequence, requiring a pixel-level understanding of audio-visual correspondence.
- Dataset
- JSON
Cityscapes

The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

2 datasets found