-
VSPW: A Large-Scale Dataset for Video Scene Parsing in the Wild
A large-scale dataset for video scene parsing in the wild. -
ImageNet2012
The dataset used in the paper for attention-oriented data analysis and attention-based adversarial defense. -
Video Frame Interpolation with Densely Queried Bilateral Correlation
The Video Frame Interpolation with Densely Queried Bilateral Correlation dataset is a large-scale video frame interpolation dataset, containing 100,000 frames with a resolution... -
Inpaint Anything
The Inpaint Anything dataset is a large-scale image inpainting dataset, containing images with missing regions. -
Microsoft COCO Dataset
The MS COCO 2014 Dataset contains images of 91 object categories, which contains 82783 training images, 40504 validation images and 40775 testing images. -
Conceptual Captions 12M
The Conceptual Captions 12M (CC-12M) dataset consists of 12 million diverse and high-quality images paired with descriptive captions and titles. -
SUN database
SUN database: Large-scale scene recognition from abbey to zoo. -
ImageNet: A Large-Scale Hierarchical Image Database
The ImageNet dataset is a large-scale image database that contains over 14 million images, each labeled with one of 21,841 categories. -
Flickr1024
Stereo image super-resolution aims to improve the quality of high-resolution stereo image pairs by exploiting complementary information across views. -
TrackingNet
The TrackingNet dataset is a benchmark for visual tracking, containing 511 video sequences with varying difficulties. -
Labeled Faces in the Wild
The dataset is a 4-way array of dimensions 4000 × 90 × 90 × 3, where each pixel gives the intensity for colors red, green and blue, resulting in a multiway array of dimensions X...