The VSPW dataset is a large-scale dataset for video scene parsing in the wild, containing 2,806 videos with 480x853 pixel frames and 124 semantic categories.
The Cityscapes dataset is a large and famous city street scene semantic segmentation dataset. 19 classes of which 30 classes of this dataset are considered for training and...