-
Deep Saliency Prior for Reducing Visual Distraction
A dataset of images used to evaluate the proposed method for reducing visual distraction in images. -
SemanticGAN
SemanticGAN uses a dataset of real images and their corresponding semantic segmentation masks. -
DatasetGAN
DatasetGAN uses a dataset of real images and their corresponding semantic segmentation masks. -
HIVE: Harnessing Human Feedback for Instructional Visual Editing
The dataset used in the paper Harnessing Human Feedback for Instructional Visual Editing (HIVE) for instructional visual editing. -
Quantitative Evaluation Dataset
A dataset comprising 3000 images for quantitative evaluation of specified region customization with both text and image inputs. -
Instruct-Video2Avatar
Given a short monocular RGB video and text instructions, our method uses an image-conditioned diffusion model to edit one head image and uses the video stylization method to... -
Region-Aware Diffusion for Zero-Shot Text-Driven Image Editing
The Region-Aware Diffusion for Zero-Shot Text-Driven Image Editing dataset is used to evaluate the ability of models to edit images based on text descriptions. -
Image Multi-Adjustment Dataset (I-MAD-Dense)
The dataset used in the paper is the Image Multi-Adjustment Dataset (I-MAD-Dense). It contains approximately 22,000 triplets, each containing a source image, an edited image,... -
LoMOE-Bench
LoMOE-Bench is a dataset for multi-object editing, featuring 64 images with 2 to 7 masks, paired with corresponding text prompts. -
Learning to follow image editing instructions
Learning to follow image editing instructions. -
Midjourney dataset
The dataset used in the paper for text-driven image editing, consisting of synthetic images generated by Midjourney. -
COCO-animals-10k
The dataset used in the paper for text-driven image editing, consisting of synthetic images generated by Midjourney and COCO dataset images. -
LSUN Churches
The dataset used for training and testing the Conditionally-Independent Pixel Synthesis (CIPS) generator. -
Flickr-Faces-HQ
Flickr-Faces-HQ contains 70,000 face images at 1024 × 1024 resolution, which were originally crawled from Flickr, manually checked to discard low-quality samples, and then...