-
RefCOCO+ and RefCOCOg
The RefCOCO+ and RefCOCOg datasets are benchmarks for referring expression comprehension. They contain images of objects and natural language descriptions of the objects. -
URVOS: Unified referring video object segmentation network with a large-scale ...
URVOS: Unified referring video object segmentation network with a large-scale benchmark. -
RefVOS: a closer look at referring expressions for video object segmentation
RefVOS: a closer look at referring expressions for video object segmentation.