-
ShapeNet Annotated with Referring Expressions (SNARE)
A benchmark dataset for grounding natural language referring expressions to distinguish 3D objects. -
WIDER Dataset
A benchmark dataset for face detection, with 32,203 images and 393,703 faces.