-
CIFAR-100 and ImageNet
The dataset used in the paper is CIFAR-100 and ImageNet. -
Hand-drawn Symbol Recognition of Surgical Flowsheet Graphs with Deep Image Se...
The dataset used in this paper for hand-drawn symbol recognition of surgical flowsheet graphs with deep image segmentation. -
FULL1 and FULL2 datasets
The FULL1 and FULL2 datasets are subsets of the Oxford RobotCar dataset, with longer route lengths. -
LOOP1 and LOOP2 datasets
The LOOP1 and LOOP2 datasets are subsets of the Oxford RobotCar dataset, with shorter route lengths. -
PhotoBot: Reference-Guided Interactive Photography via Natural Language
PhotoBot is a framework for fully automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. -
Kubric Dataset
The Kubric dataset offers training and validation data across various difficulty levels, featuring videos of objects interacting with each other. -
Physion Dataset
The Physion dataset provides training, validation, and testing data for seven scenarios, such as dominoes and support, involving rigid-body objects colliding as well as one... -
KITTI-Object
Monocular 3D object localization in driving scenes is a crucial task, but challenging due to its ill-posed nature. Estimating 3D coordinates for each pixel on the object surface... -
Improvised Aerial Object Detection approach for YOLOv3 Using Weighted Luminance
Aerial imaging of ground targets is highly challenging because of various factors that affect light propagation through different mediums. Several convolutional neural... -
ImageNet-1K, COCO, and ADE20K datasets
The dataset used in the paper is not explicitly described. However, it is mentioned that the authors used the ImageNet-1K, COCO, and ADE20K datasets for image classification,... -
Breast Ultrasound Dataset
The dataset in this paper comes from the database of the Ultrasound Imaging Department of Peking University Shenzhen Hospital. -
Moving MNIST dataset
The Moving MNIST dataset consists of videos of MNIST digits. -
KITTI 2012
KITTI 2012 is a real-world dataset in the outdoor scenario, and contains 194 training and 195 testing stereo image pairs with the size of 376 × 1240. -
Flickr1024
Stereo image super-resolution aims to improve the quality of high-resolution stereo image pairs by exploiting complementary information across views. -
Training CLIP models on Data from Scientific Papers
Contrastive Language-Image Pretraining (CLIP) models are trained with datasets extracted from web crawls, which are of large quantity but limited quality. This paper explores... -
MVSEC dataset
A real-world dataset collected in indoor and outdoor scenarios with sparse optical flow labels.