Computer Vision - Groups

DeepViT

The DeepViT dataset is a dataset for vision.

Dataset
JSON

AS-MLP: Axial Shifted MLP Architecture for Vision

An Axial Shifted MLP architecture (AS-MLP) is proposed for vision. Different from MLP-Mixer, where the global spatial feature is encoded for information flow through matrix...

Dataset
JSON

User-Controllable Latent Transformer for StyleGAN Image Layout Editing

The dataset used in the paper for user-controllable latent code transformation for StyleGAN image layout editing.

Dataset
JSON

Paciﬁc Graphics 2022

The dataset used in the paper for user-controllable latent code transformation for StyleGAN image layout editing.

Dataset
JSON

Going Deeper with Convolutions

The dataset used for training and testing the proposed method.

Dataset
JSON

DenseNet

The DenseNet dataset is a dataset for image recognition on handwritten digits composed of 60,000 training data and 10,000 test images.

Dataset
JSON

ChestX-ray8

A hospital-scale chest X-ray database, namely “ChestX-ray8”, which comprises 108,948 frontal-view X-ray images of 32,717 unique patients with the text-mined eight common disease...

Dataset
JSON

Learning compact representations for LiDAR completion and generation

Learning compact representations for LiDAR completion and generation.

Dataset
JSON

LidarDM: Generative LiDAR Simulation in a Generated World

LidarDM: A novel layout-conditioned latent diffusion model for generating realistic LiDAR point clouds.

Dataset
JSON

Occ3D-nuScenes

Occupancy prediction plays a pivotal role in au-

Dataset
JSON

Graph Matching

Graph matching (GM) constitutes a pervasive problem in computer vision and pattern recognition, with applications in image registration, recognition, stereo, 3D shape matching,...

Dataset
JSON

Contextual Convolution

Contextual convolution (CoConv) for visual recognition. CoConv is a direct replacement of the standard convolution that can be used at any stage in CNN architectures.

Dataset
JSON

EuRoC MAV Dataset

A dataset for visual odometry, containing 11 sequences with provided ground-truth poses.

Dataset
JSON

Flying Chairs

The dataset used in the paper is not explicitly described, but it is mentioned that the authors applied their approach to the challenging problem of optical flow estimation and...

Dataset
JSON

CNN Model Dataset

The dataset used in this paper is a dataset of four CNN models: ResNet-18, Vgg-16, Squeezenet v1.0, and AlexNet.

Dataset
JSON

Acoustic AVSpeech

The Acoustic AVSpeech dataset is a benchmark for visual acoustic matching.

Dataset
JSON

SoundSpaces-Speech

The SoundSpaces-Speech dataset is a benchmark for visual acoustic matching.

Dataset
JSON

Oxford Radar RobotCar dataset

The Oxford Radar RobotCar dataset is a radar extension to the Oxford RobotCar dataset, containing radar observations of a vehicle in various scenarios.

Dataset
JSON

ChainerCV

ChainerCV supports algorithms to solve tasks in the computer vision field such as object detection, while considering usability and predictable performance as the top priorities.

Dataset
JSON

VGGHeads: A Large-Scale Synthetic Dataset for 3D Human Heads

A large-scale synthetic dataset for human head detection and 3D mesh estimation.

Dataset
JSON

992 datasets found