-
Scaling autoregressive models for content-rich text-to-image generation
Scaling autoregressive models for content-rich text-to-image generation. -
Two/Three-Object Prompts (TwOP/ThreeOP)
Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability to generate high-quality images from textual descriptions. However, these models... -
Template-Based Pairs (TBP)
Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability to generate high-quality images from textual descriptions. However, these models... -
MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion
The dataset is used to evaluate the proposed MuLan framework for progressive multi-object diffusion. It contains 200 prompts with complex spatial relationships and attribute... -
ENTIGEN: Evaluating the Effect of Ethical Interventions on Text-to-Image Gene...
The ENTIGEN dataset is a benchmark for evaluating the change in the diversity of text-to-image generations in the presence of ethical interventions. -
IP-adapter
A dataset of pre-trained models and their corresponding text prompts used for text-to-image diffusion models. -
Blip-diffusion
A dataset of pre-trained models and their corresponding text prompts used for text-to-image generation and editing. -
T2I-Compbench-Count
A benchmark for open-world compositional text-to-image generation. The dataset consists of 218 prompts with a single object and its number. -
Diffusion idea exploration for art generation
The dataset used for the text-to-image generation task using diffusion models. -
LAION-Improved-Aesthetics (v1.2)
The LAION-Improved-Aesthetics (v1.2) dataset used for training the Stable Diffusion model, which includes images with captions. -
Stable Rivers
The dataset used for training the Stable Diffusion model, which includes images with captions containing various terminology relevant to the field of fluvial geomorphology. -
Stable Diffusion Prompts
The dataset used in the paper for text-to-image generation and style transfer tasks. -
Break-A-Scene: Extracting Multiple Concepts from a Single Image
The dataset is created by augmenting a single input image with masks that indicate the presence of target concepts. The masks can be provided by the user or generated... -
Elite dataset
The Elite dataset contains images with visual concepts encoded into textual embeddings. -
TI and DreamBooth dataset
The dataset used in this paper is a combined dataset of the TI dataset of 5 concepts, and the dataset from DreamBooth with 20 concepts.