-
Graph Edit Distance
Graph Edit Distance as a quadratic assignment problem. -
Binarized MNIST
We use the preprocessed binarized MNIST dataset from [49] which has a split of 50k/10k/10k. -
MNIST and CIFAR-10 datasets
The MNIST and CIFAR-10 datasets are used to test the theory suggesting the existence of many saddle points in high-dimensional functions. -
ImageNet, ImageNet ReaL, ImageNet V2, etc.
The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used various benchmarks such as ImageNet, ImageNet ReaL, ImageNet V2, etc. -
VideoAttentionTarget
VideoAttentionTarget is a video-based gaze target dataset comprising 71,666 frames from 1,331 clips. -
GazeFollow
GazeFollow is a large-scale dataset consisting of 122,143 images with 130,339 annotations on head-target instances. -
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
Gaze target detection aims to directly associate individuals and their gaze targets within a single image or across multiple video frames. -
DINOv2: Learning robust visual features without supervision
The authors propose a method for self-supervised representation learning using knowledge distillation and vision transformers. -
Diffusion Classifier
The authors propose a method for zero-shot classification that leverages conditional density estimates from text-to-image diffusion models. -
Diffusion Models Beat GANs on Image Synthesis
Diffusion models have recently emerged as the state-of-the-art of generative modeling, demonstrating remarkable results in image synthesis and across other modalities. -
Diffusion Models and Representation Learning: A Survey
Diffusion Models are popular generative modeling methods in various vision tasks, attracting significant attention. They can be considered a unique instance of self-supervised... -
NuScenes dataset
The dataset used in the paper is the NuScenes dataset, which contains LiDAR point clouds and corresponding semantic annotations. -
ImageNet and MS COCO
The dataset used in the paper is the ImageNet and MS COCO benchmarks. -
Real dataset for fabric mechanics estimation
A real dataset of depth images of fabrics, with mechanical ground truth parameters, used to test and evaluate a method for estimating fabric mechanics from depth images. -
Synthetic dataset for fabric mechanics estimation
A synthetic dataset of depth images of fabrics, with mechanical ground truth parameters, used to train and evaluate a method for estimating fabric mechanics from depth images. -
Simulation Testing Environment
The dataset used for training and testing the direct regression model, sparse correspondence model, and dense correspondence model. -
Learning Eye-in-Hand Camera Calibration from a Single Image
The dataset used for learning-based methods for online eye-in-hand camera calibration from a single RGB image.