-
CRD-CGAN: Category-Consistent and Relativistic Constraints for Diverse Text-t...
Generating photo-realistic images from a text description is a challenging problem in computer vision. Previ- ous works have shown promising performance to generate synthetic... -
Scaling autoregressive models for content-rich text-to-image generation
Scaling autoregressive models for content-rich text-to-image generation. -
Stable Diffusion v1-5
The dataset used in this paper is a text-to-image diffusion model, specifically Stable Diffusion v1-5. -
Text-to-Image Diffusion Models
The dataset used for text-to-image diffusion models, including Bluefire, Paintings, 3D, and Origami styles. -
HPSv2 dataset
The HPSv2 dataset is a text-image pair dataset containing 3200 prompts and their corresponding images. -
A-SDM: Accelerating Stable Diffusion through Model Assembly and Feature Inher...
The Stable Diffusion Model (SDM) is a preva- lent and effective model for text-to-image (T2I) and image-to-image (I2I) generation. -
Two/Three-Object Prompts (TwOP/ThreeOP)
Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability to generate high-quality images from textual descriptions. However, these models... -
Template-Based Pairs (TBP)
Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability to generate high-quality images from textual descriptions. However, these models... -
MuLan: Multimodal-LLM Agent for Progressive Multi-Object Diffusion
The dataset is used to evaluate the proposed MuLan framework for progressive multi-object diffusion. It contains 200 prompts with complex spatial relationships and attribute... -
ENTIGEN: Evaluating the Effect of Ethical Interventions on Text-to-Image Gene...
The ENTIGEN dataset is a benchmark for evaluating the change in the diversity of text-to-image generations in the presence of ethical interventions. -
IP-adapter
A dataset of pre-trained models and their corresponding text prompts used for text-to-image diffusion models. -
Blip-diffusion
A dataset of pre-trained models and their corresponding text prompts used for text-to-image generation and editing. -
T2I-Compbench-Count
A benchmark for open-world compositional text-to-image generation. The dataset consists of 218 prompts with a single object and its number. -
Custom Diffusion
The dataset used in this paper is a large-scale text-to-image diffusion model, which consists of 35 subjects with unique pets and objects. -
Continuous 3D Words
A dataset of images with fine-grained control over several 3D-aware attributes, including time-of-day illumination, bird wing orientation, dollyzoom effect, and object poses.