27 datasets found

Filter Results
  • AWA2

    AWA2 is an animal dataset containing 37,322 images from 50 classes, with 85 attributes provided by experts to describe the semantic feature of each class.
  • CIRR

    CIRR is a general image dataset that comprises 36,554 triplets derived from 21,552 images from the popular natural language inference dataset NLVR2.
  • FashionIQ

    The FashionIQ dataset contains images of fashion products over 3 categories: Dress, Toptee, and Shirt, with 46,609 images in the training and 31,075 images in the validation set.
  • Imagenette Dataset

    The Imagenette dataset is a zero-shot image classification dataset, containing 13,394 images from ten easily separable classes in ImageNet.
  • C-GQA

    The C-GQA dataset is a large-scale dataset for compositional zero-shot learning, containing 413 attributes and 674 objects.
  • CLEVR

    CLEVR images contain objects characterized by a set of attributes (shape, color, size and material). The questions are grouped into 5 categories: Exist, Count, CompareInteger,...
  • MeaCap: Memory-Augmented Zero-shot Image Captioning

    Zero-shot image captioning without well-paired image-text data can be divided into two categories, training-free and text-only-training. Generally, these two types of methods...