Dataset - LDM

Internal Text-Image Dataset

The dataset used in the paper for subject-driven text-to-image synthesis
- Dataset
- JSON
WebLI

The dataset used in the paper for subject-driven text-to-image synthesis
- Dataset
- JSON
Paint by Word

Paint by word dataset, a human-AI collaborative task which enables a user to edit a generated image by painting a semantic modification specified by any text description, to any...
- Dataset
- JSON
Visual Instruction Generation

The dataset used in the paper for visual instruction generation, containing 200 goals and instructions for various tasks.
- Dataset
- JSON
New Dataset for Zero-Shot Performance Comparison of Spatial Conditions

The dataset used in the paper is a new dataset for zero-shot performance comparison of spatial conditions.
- Dataset
- JSON
EPViT

Evaluation of image-text alignment for text-to-image synthesis models
- Dataset
- JSON
AE-276

Evaluation of image-text alignment for text-to-image synthesis models
- Dataset
- JSON
CC-500

Evaluation of image-text alignment for text-to-image synthesis models
- Dataset
- JSON
DAA-200

Evaluation of image-text alignment for text-to-image synthesis models
- Dataset
- JSON
Multiple Subjects Generation

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a large-scale text-to-image model to generate images with multiple subjects.
- Dataset
- JSON
Paired Customization

A dataset of paired style and content images used for customizing a pre-trained text-to-image model with a single image pair.
- Dataset
- JSON
MARS: Mixture of Auto-Regressive Models for Fine-grained Text-to-image Synthesis

MARS is an innovative auto-regressive framework that not only retains the capabilities of pre-trained Large Language Models (LLMs) but also incorporates top-tier text-to-image...
- Dataset
- JSON
TISE: Bag of Metrics for Text-to-Image Synthesis Evaluation

A set of metrics for evaluating text-to-image synthesis.
- Dataset
- JSON
Concept Conjunction 500 (CC-500)

The Concept Conjunction 500 (CC-500) dataset is a benchmark for text-to-image synthesis, consisting of 500 images with 500 corresponding text descriptions.
- Dataset
- JSON
Attribute Binding Contrast (ABC-6K)

The Attribute Binding Contrast (ABC-6K) dataset is a benchmark for text-to-image synthesis, consisting of 6,000 images with 6,000 corresponding text descriptions.
- Dataset
- JSON
DreamStone: Image as a Stepping Stone for Text-Guided 3D Shape Generation

Text-guided 3D shape generation approach using CLIP and pre-trained single-view reconstruction model
- Dataset
- JSON
Laion-400M

Text-to-image Latent Diffusion model, CLIP model, Blended Diffusion model, GLIDE model, GLIDE-filtered model
- Dataset
- JSON
Pixart Alpha

Diffusion Models (DMs) represent a powerful class of generative models that have gained significant attention in recent years.
- Dataset
- JSON
Concept Sliders Test Dataset

The dataset used for testing the Concept Sliders, consisting of paired image data and text prompts.
- Dataset
- JSON
Concept Sliders Dataset

The dataset used for training the Concept Sliders, consisting of paired image data and text prompts.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

33 datasets found