-
New Dataset for Zero-Shot Performance Comparison of Spatial Conditions
The dataset used in the paper is a new dataset for zero-shot performance comparison of spatial conditions. -
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Text-to-image synthesis using Factor Decomposed Generative Adversarial Networks (FDGAN) -
LDEdit: Towards Generalized Text Guided Image Manipulation via Latent Diffusi...
Open-domain image manipulation using arbitrary text prompts -
Multiple Subjects Generation
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a large-scale text-to-image model to generate images with multiple subjects. -
Paired Customization
A dataset of paired style and content images used for customizing a pre-trained text-to-image model with a single image pair. -
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
MARS is an innovative auto-regressive framework that not only retains the capabilities of pre-trained Large Language Models (LLMs) but also incorporates top-tier text-to-image... -
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation
A set of metrics for evaluating text-to-image synthesis. -
AttnGAN: Fine-grained text to image generation with attentional generative ad...
AttnGAN: Fine-grained text to image generation with attentional generative adversarial networks -
Recipe1M and CUB datasets
Recipe1M and CUB datasets for text-to-image synthesis -
Concept Conjunction 500 (CC-500)
The Concept Conjunction 500 (CC-500) dataset is a benchmark for text-to-image synthesis, consisting of 500 images with 500 corresponding text descriptions. -
Attribute Binding Contrast (ABC-6K)
The Attribute Binding Contrast (ABC-6K) dataset is a benchmark for text-to-image synthesis, consisting of 6,000 images with 6,000 corresponding text descriptions. -
DreamStone: Image as a Stepping Stone for Text-Guided 3D Shape Generation
Text-guided 3D shape generation approach using CLIP and pre-trained single-view reconstruction model -
Balance Swap-Sampling for Creative Text Pair-to-Object Generation
Generating creative combinatorial objects by combining seemingly unrelated object concepts. -
Laion-400M
Text-to-image Latent Diffusion model, CLIP model, Blended Diffusion model, GLIDE model, GLIDE-filtered model -
Pick-a-Pic dataset
The dataset used in the paper is the Pick-a-Pic dataset, which consists of 87,687 pairs of text prompts and images. -
PartiPrompts dataset
The dataset used in the paper is the PartiPrompts dataset, which consists of 851,293 pairs of text prompts and images. -
Adding conditional control to text-to-image diffusion models
Adding conditional control to text-to-image diffusion models. -
Learning to follow image editing instructions
Learning to follow image editing instructions.