YouTube-VOS

doi:doi:10.57702/qzdb98hv

You're currently viewing an old version of this dataset. To see the current version, click here.

YouTube-VOS

One-shot Video Object Segmentation (VOS) is the task of pixel-wise tracking an object of interest within a video sequence, where the segmentation mask of the ﬁrst frame is given at inference time.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

N. Xu, L. Yang, Y. Fan, D. Yue, Y. Liang, J. Yang, T. Huang (2024). Dataset: YouTube-VOS. https://doi.org/10.57702/qzdb98hv

DOI retrieved: December 2, 2024

Additional Info

Field	Value
Created	December 2, 2024
Last update	December 2, 2024
Defined In	https://doi.org/10.1109/TCSVT.2023.3341170
Citation	https://doi.org/10.48550/arXiv.2208.10662 https://doi.org/10.48550/arXiv.2103.12934 https://doi.org/10.48550/arXiv.2112.02853 https://doi.org/10.48550/arXiv.2304.06718 https://doi.org/10.48550/arXiv.2010.05069
Author	N. Xu
More Authors	L. Yang Y. Fan D. Yue Y. Liang J. Yang T. Huang
Homepage	https://www.youtube-vos.org/