-
Argoverse2
Argoverse2 is an open-source evolution of the original Argoverse -
Video Object of Interest Segmentation
A new computer vision task named video object of interest segmentation (VOIS). Given a video and a target image of interest, the objective is to simultaneously segment and track... -
SUN Attribute Dataset
The SUN attribute dataset is a collection of images of scenes. -
Multi-source multi-scale counting in extremely dense crowd images
The UCF CC 50 dataset contains 50 images collected from publicly available web images. -
MNIST, USPS, and CIFAR10
The dataset used in this paper is MNIST, USPS, and CIFAR10. The dataset is used for privacy-preserving CNN training. -
SUN database
SUN database: Large-scale scene recognition from abbey to zoo. -
CARLA simulator datasets for pedestrian detection
Three datasets: training, calibration, and evaluation datasets for pedestrian detection task. -
Automated age-related macular degeneration area estimation – first results
The dataset is used for automatic method for detecting Age-related Macular Degeneration (AMD) lesions in RGB eye fundus images. -
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-end panoptic segmentation with mask transformers -
Submanifold Sparse Convolutional Networks
Convolutional network are the de-facto standard for analysing spatio-temporal data such as images, videos, 3D shapes, etc. Whilst some of this data is naturally dense (for... -
BACH 2018 grand challenge
Breast cancer histology images classification using transfer learning -
MS COCO dataset
The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each. -
Vi-Fi Multi-modal Dataset
The dataset used in the paper for layout sequence prediction from noisy mobile modality. -
ImageNet: A Large-Scale Hierarchical Image Database
The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories. -
KITTI Benchmark
A benchmark for stereo matching and depth estimation. -
MoViNets: Mobile Video Networks for Efficient Video Recognition
Mobile Video Networks (MoViNets) is a family of computation and memory efficient video networks that can operate on streaming video for online inference. -
Hyperhuman
Hyperhuman dataset for 3D face rendering