-
Visual Recognition Task
The dataset used in the paper is not explicitly described, but it is mentioned that it is a visual recognition task. -
Contextual Convolution
Contextual convolution (CoConv) for visual recognition. CoConv is a direct replacement of the standard convolution that can be used at any stage in CNN architectures. -
ImageNet Large Scale Visual Recognition Challenge (ILSVRC)
The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories. -
Visual Recognition Tasks
The dataset used in the paper is a set of visual recognition tasks, spanning both the data-abundant and data-limited regimes. -
ImageNet Large Scale Visual Recognition Challenge 2012
This dataset is used to evaluate the performance of a Convolutional Neural Network (CNN) on the ImageNet Large Scale Visual Recognition Challenge (ILSVRC2012). -
Visual and Semantic Similarity in ImageNet
This dataset is used to evaluate the performance of a Convolutional Neural Network (CNN) on the ImageNet Large Scale Visual Recognition Challenge (ILSVRC2012). -
MiniImageNet, Caltech-UCSD Birds 200-2011, TieredImageNet, OfficeHome
MiniImageNet, Caltech-UCSD Birds 200-2011, TieredImageNet, OfficeHome -
Cap2Aug: Caption guided Image to Image data Augmentation
Visual recognition in a low-data regime is challenging and often prone to overfitting. To mitigate this issue, several data augmentation strategies have been proposed. However,... -
Deep High-Resolution Representation Learning for Visual Recognition
The Deep High-Resolution Representation Learning for Visual Recognition dataset. -
ImageNet 2012 Large-Scale Visual Recognition Challenge
The dataset used in the paper is the ImageNet 2012 Large-Scale Visual Recognition Challenge dataset. -
OfficeHome dataset
The OfficeHome dataset is a benchmark for domain generalization, containing 13,000 images from 6 domains: Clipart, Product, Real World, Art, Product, and Real World. -
iNaturalist2018
The iNaturalist2018 dataset is the largest dataset for long-tailed visual recognition. -
Microsoft COCO
The Microsoft COCO dataset was used for training and evaluating the CNNs because it has become a standard benchmark for testing algorithms aimed at scene understanding and... -
ImageNet Large Scale Visual Recognition Challenge
A benchmark for low-shot recognition was proposed by Hariharan & Girshick (2017) and consists of a representation learning phase without access to the low-shot classes and a...