-
Davis and WebVid datasets
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used 26 text-video pairs from the public DAVIS and WebVid datasets. -
COCO Captions
Object detection is a fundamental task in computer vision, requiring large annotated datasets that are difficult to collect. -
GEMRec-18K
The GEMRec-18K dataset contains 18 thousand images generated by 200 text-to-image diffusion models fine-tuned on Stable Diffusion. -
ART•V: Auto-Regressive Text-to-Video Generation with Diffusion Models
ART•V is an efficient framework for auto-regressive video generation with diffusion models. It generates a single frame at a time, conditioned on the previous ones. -
Non-linear Correction for Diffusion Model at Large Guidance Scale
The dataset used in the paper is a large-scale image generation dataset, which is used to evaluate the performance of the characteristic guidance method. -
UniControl: A Unified Diffusion Model for Controllable Visual Generation In t...
UniControl is a unified diffusion model for controllable visual generation in the wild, which is capable of simultaneously handling various visual conditions for the... -
Beta Diffusion
Beta diffusion is a novel generative modeling method that integrates demasking and denoising to generate data within bounded ranges. -
Training Data Attribution for Diffusion Models
Diffusion models have become increasingly popular for synthesizing high-quality samples based on training datasets. However, given the oftentimes enormous sizes of the training... -
DiffusionDB
A large database of 2 million images, which can also be downloaded and used as open source. -
T2I-CompBench
The dataset used in the paper is not explicitly mentioned, but it is implied to be a text-to-image dataset. -
DiffusionDB: A Large-Scale Prompt Gallery Dataset for Text-to-Image Generativ...
DiffusionDB: A large-scale prompt gallery dataset for text-to-image generative models. -
A CHEAPER AND BETTER DIFFUSION LANGUAGE MODEL WITH SOFT-MASKED NOISE
Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation.