Automated age-related macular degeneration area estimation – first results
The dataset is used for automatic method for detecting Age-related Macular Degeneration (AMD) lesions in RGB eye fundus images. -
MaX-DeepLab: End-to-End Panoptic Segmentation with Mask Transformers
MaX-DeepLab: End-to-end panoptic segmentation with mask transformers -
Submanifold Sparse Convolutional Networks
Convolutional network are the de-facto standard for analysing spatio-temporal data such as images, videos, 3D shapes, etc. Whilst some of this data is naturally dense (for... -
BACH 2018 grand challenge
Breast cancer histology images classification using transfer learning -
MS COCO dataset
The MS COCO dataset is a large benchmark for image captioning, containing 328K images with 5 caption descriptions each. -
Vi-Fi Multi-modal Dataset
The dataset used in the paper for layout sequence prediction from noisy mobile modality. -
ImageNet: A Large-Scale Hierarchical Image Database
The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories. -
KITTI Benchmark
A benchmark for stereo matching and depth estimation. -
MoViNets: Mobile Video Networks for Efficient Video Recognition
Mobile Video Networks (MoViNets) is a family of computation and memory efficient video networks that can operate on streaming video for online inference. -
Hyperhuman dataset for 3D face rendering -
MonoDiffusion: Self-Supervised Monocular Depth Estimation Using Diffusion Model
MonoDiffusion: A novel self-supervised monocular depth estimation framework by reformulating it as an iterative denoising process. -
InsPLAD: A Dataset and Benchmark for Power Line Asset Inspection in UAV Images
Power line maintenance and inspection are essential to avoid power supply interrup- tions, reducing its high social and financial impacts yearly. Automating power line visual... -
METER: a mobile vision transformer architecture for monocular depth estimation
Monocular depth estimation is a fundamental knowledge for autonomous systems that need to assess their own state and perceive the surrounding environment. -
Omniglot dataset
The Omniglot dataset consists of 100 classes, each containing 20 images. Ten images were taken from each class for augmentation, and the rest were used as the test set. Each... -
ImageNet with CMA-Search
A dataset of ImageNet images with subtle 3D perspective changes that can break ImageNet-trained classification networks. -
Controlled Rendered Data of Real World Objects
A dataset of complex image data with a fixed, known distribution, generated using a computer graphics pipeline. -
Blender Dataset
The Blender dataset consists of 8 synthetic 3D scenes, each with a hundred posed images of resolution 800 × 800.