-
Visual Instruction Generation
The dataset used in the paper for visual instruction generation, containing 200 goals and instructions for various tasks. -
New Dataset for Zero-Shot Performance Comparison of Spatial Conditions
The dataset used in the paper is a new dataset for zero-shot performance comparison of spatial conditions. -
Multiple Subjects Generation
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a large-scale text-to-image model to generate images with multiple subjects. -
Paired Customization
A dataset of paired style and content images used for customizing a pre-trained text-to-image model with a single image pair. -
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis
MARS is an innovative auto-regressive framework that not only retains the capabilities of pre-trained Large Language Models (LLMs) but also incorporates top-tier text-to-image... -
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation
A set of metrics for evaluating text-to-image synthesis. -
Concept Conjunction 500 (CC-500)
The Concept Conjunction 500 (CC-500) dataset is a benchmark for text-to-image synthesis, consisting of 500 images with 500 corresponding text descriptions. -
Attribute Binding Contrast (ABC-6K)
The Attribute Binding Contrast (ABC-6K) dataset is a benchmark for text-to-image synthesis, consisting of 6,000 images with 6,000 corresponding text descriptions. -
DreamStone: Image as a Stepping Stone for Text-Guided 3D Shape Generation
Text-guided 3D shape generation approach using CLIP and pre-trained single-view reconstruction model -
Laion-400M
Text-to-image Latent Diffusion model, CLIP model, Blended Diffusion model, GLIDE model, GLIDE-filtered model -
Pixart Alpha
Diffusion Models (DMs) represent a powerful class of generative models that have gained significant attention in recent years. -
Concept Sliders Test Dataset
The dataset used for testing the Concept Sliders, consisting of paired image data and text prompts. -
Concept Sliders Dataset
The dataset used for training the Concept Sliders, consisting of paired image data and text prompts. -
Photorealistic text-to-image diffusion models with deep language understanding
The authors present a photorealistic text-to-image diffusion model with deep language understanding. -
Coarse-Fine Granularity Prompts dataset (CFP)
Coarse-Fine Granularity Prompts dataset (CFP) is a collection of 81,910 data instances from popular text-to-image community.