Object Description - Groups

RefCOCO+ and RefCOCOg

The RefCOCO+ and RefCOCOg datasets are benchmarks for referring expression comprehension. They contain images of objects and natural language descriptions of the objects.

Dataset
JSON

Cap3D Objaverse

Cap3D Objaverse is a dataset of 660K 3D-text pairs, created using an automated captioning process.

Dataset
JSON

Text2Shape

Text2Shape is a dataset of 8,447 table instances and 6,591 chair instances from the ShapeNet dataset, along with 75,344 natural language descriptions.

Dataset
JSON

ScanRefer

ScanRefer is a dataset of 51,583 referring descriptions of 11,046 objects from 800 ScanNet scenes.

Dataset
JSON

4 datasets found

RefCOCO+ and RefCOCOg

Cap3D Objaverse

Text2Shape

ScanRefer