320 datasets found

Tags: Computer Vision

Filter Results
  • Nerf++

    The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.
  • NeRF-RL

    The dataset used in the paper is a collection of images of objects in 3D space, with multiple views of each object.
  • MS COCO dataset

    The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each.
  • ImageNet: A Large-Scale Hierarchical Image Database

    The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories.
  • Hyperhuman

    Hyperhuman dataset for 3D face rendering
  • ImageNet21K

    The ImageNet21K dataset is used for training and evaluation of the proposed Circulant Channel-Specific (CCS) token-mixing MLP.
  • MS COCO 2017

    The dataset used in this paper is a collection of frames for video coding, with different Quantisation Parameters (QPs) and frame types.
  • Omniglot dataset

    The Omniglot dataset consists of 100 classes, each containing 20 images. Ten images were taken from each class for augmentation, and the rest were used as the test set. Each...
  • Blender Dataset

    The Blender dataset consists of 8 synthetic 3D scenes, each with a hundred posed images of resolution 800 × 800.
  • TinyImagenet dataset

    The dataset used in the paper is not explicitly described, but it is mentioned that the authors used TinyImagenet dataset for pre-training the embedding functions.
  • CIFAR-10, CIFAR-100, and STL-10 datasets

    The dataset used in the paper is not explicitly described, but it is mentioned that the authors used CIFAR-10, CIFAR-100, and STL-10 datasets for training and testing the...
  • Surface Networks

    The dataset used in the paper is a 3D mesh dataset, which is used for training and testing the Surface Networks model.
  • Neural 3D Mesh Renderer

    The dataset used in the paper Neural 3D Mesh Renderer. The dataset consists of 3D models of objects.
  • SceneFlow

    Large dataset for stereo matching, optical flow, and scene flow estimation
  • Human Action Recognition

    The Human Action Recognition dataset is used for human action recognition tasks.
  • Places

    The dataset used in the paper is Places, a large dataset of 400k pairs of images from the Places 205 dataset and corresponding spoken audio captions.
  • MVG

    The MVG dataset, which contains 1,009 samples, each with five images of the same person wearing the same garment from five different views.
  • MV-VTON: Multi-View Virtual Try-On with Diffusion Models

    The proposed method for Multi-View Virtual Try-On (MV-VTON) task, which aims at using the frontal and back clothing to reconstruct the dressing results of a person from multiple...
  • SPair-71k

    The proposed method, dubbed Dynamic Hyperpixel Flow, learns to compose hypercolumn features on the fly by selecting a small number of relevant layers from a deep convolutional...
  • ResNet-50

    The dataset used in the paper is the ResNet-50 dataset, a convolutional neural network model.
You can also access this registry using the API (see API Docs).