Text-to-Image Generation - Groups

SDXL

The dataset used for training the diffusion model, containing 2M images.

Dataset
JSON

Break-A-Scene: Extracting Multiple Concepts from a Single Image

The dataset is created by augmenting a single input image with masks that indicate the presence of target concepts. The masks can be provided by the user or generated...

Dataset
JSON

LAION-2B

The dataset used in the paper is LAION-2B, which is a large-scale image-text dataset. The authors fine-tune a pre-trained diffusion model with a subset of LAION-2B with 10k...

Dataset
JSON

Localized Narratives-COCO-5K

The dataset used for training and evaluation of the W¨urstchen model.

Dataset
JSON

COCO-30K