-
Densely Annotated Video Segmentation (DAVIS)
The Davis dataset contains fifty high-resolution videos with pixel-accurate ground truth. -
Video Segmentation Datasets
The dataset used for video segmentation, including Moving MNIST, Change detection, Segtrack V2, and Davis datasets. -
A Simple Video Segmenter by Tracking Objects Along Axial Trajectories
Video segmentation requires consistently segmenting and tracking objects over time. Due to the quadratic dependency on input size, directly applying self-attention to video... -
INRIA YouTube Instructional Videos
The INRIA YouTube Instructional Videos dataset contains five tasks of different instructional domains: “making coffee”, “changing a car tire”, “CPR”, “jumping a car”, and... -
Breakfast Actions
The Breakfast Actions dataset contains 70 hours of cooking activities of varying complexity. It contains 10 different cooking tasks (with about 170 videos per task), which can... -
Youtube-Objects
Video object segmentation is challenging due to the factors like rapidly fast motion, cluttered backgrounds, arbitrary object appearance variation and shape deformation. -
DAVIS-17 Dataset
The DAVIS-17 dataset, which is a benchmark for video object segmentation. -
OMG-Seg Dataset
The dataset used for training and testing the OMG-Seg model, which includes COCO panoptic, COCO-SAM, VIPSeg, Youtube-VIS-2019, and Youtube-VIS-2021 datasets. -
Youtube-VOS
The Youtube-VOS dataset is a large-scale benchmark for multi-object video object segmentation, containing 120 videos with dense annotation.