-
Kodak PhotoCD dataset
The dataset used in the paper is the Kodak PhotoCD dataset, which contains twenty-four 768 × 512 images. -
OpenImages
Large-scale vision-and-language models trained on curated and web-scrapped data have led to significant improvements over task-specific models when transferred to downstream...