6 datasets found

Groups: Text-to-Image Generation

Filter Results
  • SDXL

    The dataset used for training the diffusion model, containing 2M images.
  • Break-A-Scene: Extracting Multiple Concepts from a Single Image

    The dataset is created by augmenting a single input image with masks that indicate the presence of target concepts. The masks can be provided by the user or generated...
  • LAION-2B

    The dataset used in the paper is LAION-2B, which is a large-scale image-text dataset. The authors fine-tune a pre-trained diffusion model with a subset of LAION-2B with 10k...
  • Localized Narratives-COCO-5K

    The dataset used for training and evaluation of the W¨urstchen model.
  • COCO-30K

    The dataset used for training and evaluation of the W¨urstchen model.
  • COCO

    Large scale datasets [18, 17, 27, 6] boosted text conditional image generation quality. However, in some domains it could be difficult to make such datasets and usually it could...