1 dataset found

Groups: Vision-Language Pre-training Organizations: No Organization

Filter Results
  • COCO Captions

    Object detection is a fundamental task in computer vision, requiring large annotated datasets that are difficult to collect.