Visual Semantic Role Labeling

The V-COCO dataset is a large-scale visual semantic graph dataset, containing 4,946 images annotated with object and predicate labels to form a scene graph.

BibTex: