You're currently viewing an old version of this dataset. To see the current version, click here.

YouTube-VOS

One-shot Video Object Segmentation (VOS) is the task of pixel-wise tracking an object of interest within a video sequence, where the segmentation mask of the first frame is given at inference time.

Data and Resources

Cite this as

N. Xu, L. Yang, Y. Fan, D. Yue, Y. Liang, J. Yang, T. Huang (2024). Dataset: YouTube-VOS. https://doi.org/10.57702/qzdb98hv

DOI retrieved: December 2, 2024

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Defined In https://doi.org/10.1109/TCSVT.2023.3341170
Citation
  • https://doi.org/10.48550/arXiv.2208.10662
  • https://doi.org/10.48550/arXiv.2103.12934
  • https://doi.org/10.48550/arXiv.2112.02853
  • https://doi.org/10.48550/arXiv.2304.06718
  • https://doi.org/10.48550/arXiv.2010.05069
Author N. Xu
More Authors
L. Yang
Y. Fan
D. Yue
Y. Liang
J. Yang
T. Huang
Homepage https://www.youtube-vos.org/