14 datasets found

Tags: Computer Vision

Filter Results
  • DenseNet

    The DenseNet dataset is a dataset for image recognition on handwritten digits composed of 60,000 training data and 10,000 test images.
  • ImageNet Large Scale Visual Recognition Challenge (ILSVRC)

    The ImageNet Large Scale Visual Recognition Challenge (ILSVRC) dataset is a large-scale image classification dataset containing over 14 million images from 21,841 categories.
  • PASCAL VOC Dataset

    The PASCAL VOC dataset contains 20 classes, including person, animal, vehicle, and indoor, with 9,963 images containing 24,640 annotated objects.
  • MIT Indoor Scene Recognition

    The MIT Indoor Scene Recognition dataset contains 67 categories of indoor scenes.
  • ImageNet: A Large-Scale Hierarchical Image Database

    The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories.
  • Places

    The dataset used in the paper is Places, a large dataset of 400k pairs of images from the Places 205 dataset and corresponding spoken audio captions.
  • ResNet-18

    Training neural networks on image datasets generally require extensive experimentation to find the optimal learning rate regime.
  • Imagenet

    The dataset used in the paper is the Imagenet dataset, which is a large annotated dataset of images used for image classification tasks.
  • PASCAL VOC 2007

    Multi-label image recognition is a practical and challenging task compared to single-label image classification.
  • ImageNet Dataset

    Object recognition is arguably the most important problem at the heart of computer vision. Recently, Barbu et al. introduced a dataset called ObjectNet which includes objects in...
  • ImageNet Large Scale Visual Recognition Challenge

    A benchmark for low-shot recognition was proposed by Hariharan & Girshick (2017) and consists of a representation learning phase without access to the low-shot classes and a...
  • FFHQ

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...
  • Learning Multiple Layers of Features from Tiny Images

    The CIFAR-10 dataset consists of 60,000 training images and 10,000 test images. Each image is a 32×32 color image.
  • COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...