-
Factor Decomposed Generative Adversarial Networks for Text-to-Image Synthesis
Text-to-image synthesis using Factor Decomposed Generative Adversarial Networks (FDGAN) -
AttnGAN: Fine-grained text to image generation with attentional generative ad...
AttnGAN: Fine-grained text to image generation with attentional generative adversarial networks -
Recipe1M and CUB datasets
Recipe1M and CUB datasets for text-to-image synthesis -
Pick-a-Pic dataset
The dataset used in the paper is the Pick-a-Pic dataset, which consists of 87,687 pairs of text prompts and images. -
PartiPrompts dataset
The dataset used in the paper is the PartiPrompts dataset, which consists of 851,293 pairs of text prompts and images. -
Text-to-Image Synthesis
The dataset used in the paper is a text-to-image synthesis dataset. -
Ablating Concepts in Text-to-Image Diffusion Models
Large-scale text-to-image diffusion models can gener-ate high-fidelity images with powerful compositional ability. However, these models are typically trained on an enormous... -
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
A dataset of Creative-Commons-licensed images, which is used to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). -
AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose
AvatarVerse is a stable pipeline for generating expressive high-quality 3D avatars from text descriptions and pose guidance.