-
QMUL-Chair-V2
Fine-grained sketch-based image retrieval (FG-SBIR) aims to minimize the distance between sketches and corresponding images in the embedding space. -
Clothes-V1
Fine-grained sketch-based image retrieval (FG-SBIR) aims to minimize the distance between sketches and corresponding images in the embedding space. -
Products-10K
The Products-10K dataset is a large-scale image retrieval dataset, containing images of products from an e-commerce website. -
Google Landmarks 2020 Dataset
The Google Landmarks 2020 Dataset is a large-scale image retrieval dataset, containing images of landmarks from around the world. -
GUIE Challenge
The Google Universal Image Embedding (GUIE) Challenge dataset is a large-scale image retrieval dataset, covering a wide distribution of objects: landmarks, artwork, food, etc. -
Visual Concept Search
A dataset for visual concept search, where the goal is to identify images containing relevant content based on visual concepts. -
Cambridge Landmarks
The Cambridge Landmarks dataset contains 5 different large outdoor scenes of landmarks in the city of Cambridge. -
ZeroSearch Dataset
A custom dataset to simulate a user's image directory for testing the ZeroSearch algorithm. -
Frozen in Time: A Joint Video and Image Encoder for End-to-End Retrieval
Frozen in time: A joint video and image encoder for end-to-end retrieval. -
Birds-to-Words
The Birds-to-Words dataset contains 15,931 images (12,770 training and 3,151 testing) tagged with descriptions of fine-grained differences between pairwise bird images. -
Oxford5k and Paris6k
Oxford5k and Paris6k are large-scale image retrieval datasets. -
CUB200-2011
The dataset used in the paper is CUB200-2011, a fine-grained image classification dataset. -
LabelMe dataset
The LabelMe dataset is a natural scene dataset used for testing the performance of the IBTM model on image classification tasks.