Datasets Activity Stream About Order by Relevance Name Ascending Name Descending Last Modified Go 1 dataset found Groups: Vision-Language Pre-training Filter Results COCO Captions Object detection is a fundamental task in computer vision, requiring large annotated datasets that are difficult to collect. Dataset JSON