-
Colonoscopy Coverage Revisited: Identifying Scanning Gaps in Real-Time
A dataset of 250 colonoscopy procedures (videos) with annotations of gaps with deficient local coverage. -
Office-Caltech dataset
The Office-Caltech dataset is a dataset of images from four domains: Amazon, Dslr, Webcam, and Caltech10. -
HPatches (HP) dataset
The HPatches (HP) dataset is a dataset of local patches used for image retrieval and local feature descriptors. -
Phototourism (PT) dataset and HPatches (HP) dataset
The dataset used in the paper is the Phototourism (PT) dataset and the HPatches (HP) dataset. -
Multimodal Convolutional Neural Networks for Matching Image and Sentence
Multimodal convolutional neural networks for matching image and sentence -
Fashion IQ
Fashion IQ is a new dataset for research on natural language based image retrieval systems, which is situated in the detail-critical fashion domain. -
FashionGen [61] and FashionIQ [83]
FashionGen [61] for XMR, SCR, and FIC, FashionIQ [83] for TGIR -
Archive of Many Outdoor Scenes (AMOS)
A dataset for evaluating the performance of image tampering detection methods. -
BigEarthNet-MM
A large-scale benchmark archive for remote sensing image classification and retrieval. -
Image Retrieval Dataset
The dataset used in this paper is a collection of images from different classes, including Africa, Monuments, Animals, and People. -
A Deep Hashing Learning Network
The proposed method uses two benchmark datasets with different kinds of images, MNIST and CIFAR-10. -
ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents
The ICDAR 2019 Competition on Image Retrieval for Historical Handwritten Documents dataset consists of 20,000 document images representing about 10,000 writers, divided into... -
ImageCLEF corpus
The dataset used in this study is the ImageCLEF corpus. -
MIRFLICKR-25K
The MIRFLICKR-25K dataset consists of 25015 images and 223635 tags, where each image is associated with several textual tags and annotated with a 24-dimensional semantic label. -
Wide-area image geolocalization with aerial reference imagery
The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation. -
C-BEV: Contrastive Bird’s Eye View Training for Cross-View Image Retrieval an...
The CVUSA and CVACT datasets are used for cross-view geolocalization. The VIGOR dataset is used for cross-view image retrieval and 3-DoF pose estimation.