Dataset - LDM

RVOS-D

RVOS-D provides more complex language descriptions from a broader object categories within relatively longer video sequences.
- Dataset
- JSON
Ref-DAVIS17

Ref-DAVIS17 is an extension of the DAVIS17 dataset, where it enhances the dataset by providing language descriptions for each specific object present in the videos.
- Dataset
- JSON
RefSAM: Efficiently Adapting Segmenting Anything Model for Referring Video Ob...

Referring video object segmentation (RVOS) aims to accurately segment the target object in the video with the guidance of given language expressions.
- Dataset
- JSON
Ref-Youtube-VOS

Ref-Youtube-VOS is an extensive referring video object segmentation dataset that comprises approximately 15,000 referring expressions associated with more than 3,900 videos.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found