Dataset - LDM

Pascal VOC

Semantic segmentation is a crucial and challenging task for image understanding. It aims to predict a dense labeling map for the input image, which assigns each pixel a unique...
- Dataset
- JSON
YCB-Video

The dataset used for 6D object pose estimation, consisting of images of 16 objects with varying levels of occlusion.
- Dataset
- JSON
Objaverse

The Objaverse dataset contains around 800k 3D objects. After adopting simple filter leveraging CLIP [27] to remove the objects whose rendered images are not relevant to its...
- Dataset
- JSON
Various Datasets

The datasets used in the paper are described as follows: WikiMIA, BookMIA, Temporal Wiki, Temporal arXiv, ArXiv-1 month, Multi-Webdata, LAION-MI, Gutenberg.
- Dataset
- JSON
Inception Transformer

Recent studies show that Transformer has strong capability of building long-range dependencies, yet is incompetent in capturing high frequencies that predomi- nantly convey...
- Dataset
- JSON
MOCA: Masked Online Codebook Assignments prediction

Self-supervised representation learning for Vision Transformers (ViT) to mitigate the greedy needs of ViT networks for very large fully-annotated datasets.
- Dataset
- JSON
ShapeNetPart

The dataset used in the paper is ShapeNetPart, a synthetic dataset for 3D object part segmentation. It contains 16,881 models from 16 categories.
- Dataset
- JSON
DTU MVS

Large scale multi-view stereopsis evaluation dataset.
- Dataset
- JSON
PCB Component Detection using Computer Vision for Hardware Assurance

The dataset used in this study is a semantic PCB image dataset, which contains images of PCBs with annotated components.
- Dataset
- JSON
CIFAR10 and CIFAR100

The dataset used in the paper is not explicitly described, but it is mentioned that the authors conducted experiments on various vision tasks, including image classification,...
- Dataset
- JSON
N-object dataset testing

An N-object dataset for testing the proposed framework
- Dataset
- JSON
Synthetic-NSVF

The dataset used in the paper SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World
- Dataset
- JSON
Synthetic-NeRF

The dataset used in the paper SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World
- Dataset
- JSON
Compute trends across three eras of machine learning

A dataset of 650 machine learning models presented in academic publications and relevant gray literature.
- Dataset
- JSON
Multiscale Vision Transformers

Multiscale Vision Transformers (MViT) for video and image recognition, by connecting the seminal idea of multiscale feature hierarchies with transformer models.
- Dataset
- JSON
CLEVR

CLEVR images contain objects characterized by a set of attributes (shape, color, size and material). The questions are grouped into 5 categories: Exist, Count, CompareInteger,...
- Dataset
- JSON
SUN RGB-D

RGB-D scene recognition approaches often train two standalone backbones for RGB and depth modalities with the same Places or ImageNet pre-training. However, the pre-trained...
- Dataset
- JSON
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efﬁcient CLIP Training

We propose DisCo-CLIP, a distributed memory-efﬁcient CLIP training approach, to reduce the memory consump- tion of contrastive loss when training contrastive learning models.
- Dataset
- JSON
ImageNet Dataset

Object recognition is arguably the most important problem at the heart of computer vision. Recently, Barbu et al. introduced a dataset called ObjectNet which includes objects in...
- Dataset
- JSON
LSUN-Church

Progress in GANs has enabled the generation of high-res-olution photorealistic images of astonishing quality. StyleGANs allow for compelling attribute modification on such...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

992 datasets found