2 datasets found

Tags: multimodal matching

Filter Results
  • Flickr8K

    Flickr8K dataset is a collection of 8,000 images with 5 sentences describing each image content
  • Microsoft COCO

    The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and...
You can also access this registry using the API (see API Docs).