-
N-object dataset testing
An N-object dataset for testing the proposed framework -
Synthetic-NSVF
The dataset used in the paper SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World -
Synthetic-NeRF
The dataset used in the paper SpikingNeRF: Making Bio-inspired Neural Networks See through the Real World -
Compute trends across three eras of machine learning
A dataset of 650 machine learning models presented in academic publications and relevant gray literature. -
Multiscale Vision Transformers
Multiscale Vision Transformers (MViT) for video and image recognition, by connecting the seminal idea of multiscale feature hierarchies with transformer models. -
DisCo-CLIP: A Distributed Contrastive Loss for Memory Efficient CLIP Training
We propose DisCo-CLIP, a distributed memory-efficient CLIP training approach, to reduce the memory consump- tion of contrastive loss when training contrastive learning models. -
ImageNet Dataset
Object recognition is arguably the most important problem at the heart of computer vision. Recently, Barbu et al. introduced a dataset called ObjectNet which includes objects in... -
LSUN-Church
Progress in GANs has enabled the generation of high-res-olution photorealistic images of astonishing quality. StyleGANs allow for compelling attribute modification on such... -
Benchmark Datasets for Vision Recognition
The dataset used in the paper is a benchmark dataset for vision recognition, consisting of 10 datasets: Tiny ImageNet, Caltech-256, Flowers-102, Food-101, CIFAR-100, CIFAR-10,... -
Scene Flow and KITTI2015 datasets
Two publicly available datasets for training and testing: Scene Flow datasets and KITTI2015 dataset -
INFOBATCH: LOSSLESS TRAINING SPEED UP BY UNBIASED DYNAMIC DATA PRUNING
Data pruning aims to obtain lossless performances with less overall cost. A common approach is to filter out samples that make less contribution to the training. -
Waymo Open Motion Dataset
The Waymo Open Motion Dataset is a large-scale dataset for autonomous driving, containing 104,000 20-second frames of driving scenarios marked at 10 Hz. -
Comparison of 2D vs. 3D U-Net Organ Segmentation in abdominal 3D CT images
80 CT-scans and related label data found in various public sources were considered for training and testing of U-Net architectures. -
Visual AutoRegressive modeling (VAR)
Visual AutoRegressive modeling (VAR), a new generation paradigm that redefines the autoregressive learning on images as coarse-to-fine “next-scale prediction” or... -
NeRF-Synthetic
A point cloud rendering method that achieves comparable rendering performance to NeRF.