-
CIFAR-10, CIFAR-100, and MNIST
The dataset used in the paper is a benchmark dataset for diffusion models, specifically denoising diffusion probabilistic models (DDPM). The dataset consists of images from... -
Envision3D: One Image to 3D
Envision3D generates 32 dense view images and extracts high-quality 3D content from one input image in 3-4 minutes. -
Diffusion-generated Deepfake Detection dataset (D3)
This dataset contains images generated by various text-to-image models, including Stable Diffusion 1.4, Stable Diffusion 2.1, Stable Diffusion XL, and DeepFloyd IF. -
Robust CLIP-Based Detector for Exposing Diffusion Model-Generated Images
Diffusion models (DMs) have revolutionized image generation, producing high-quality images with applications spanning various fields. However, their ability to create... -
Towards performant and reliable undersampled MR reconstruction via diffusion ...
Towards performant and reliable undersampled MR reconstruction via diffusion model sampling. -
High-frequency space diffusion models for accelerated MRI
High-frequency space diffusion models for accelerated MRI. -
Self-Supervised MRI Reconstruction with Unrolled Diffusion Models
Magnetic Resonance Imaging (MRI) produces excellent soft tissue contrast, albeit it is an inherently slow imaging modality. Promis- ing deep learning methods have recently been... -
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
A dataset of Creative-Commons-licensed images, which is used to train a set of open diffusion models that are qualitatively competitive with Stable Diffusion 2 (SD2). -
Extended MusicCaps
Extended MusicCaps is a music caption dataset that is extended to include images. -
MELFUSION: Synthesizing Music from Image and Language Cues using Diffusion Mo...
MELFUSION is a text-to-music diffusion model that can synthesize music conditioned on both visual and textual modality. -
Particle Denoising Diffusion Sampler
Denoising diffusion models have become ubiquitous for generative modeling. The core idea is to transport the data distribution to a Gaussian by using a diffusion. Approximate... -
AvatarVerse: High-quality & Stable 3D Avatar Creation from Text and Pose
AvatarVerse is a stable pipeline for generating expressive high-quality 3D avatars from text descriptions and pose guidance. -
Diffusion-based speech enhancement with a weighted generative-supervised lear...
Diffusion-based speech enhancement with a weighted generative-supervised learning loss -
Pixel-Aware Stable Diffusion for Realistic Image Super-Resolution and Persona...
Realistic image super-resolution (Real-ISR) and image stylization -
DiffusionDB
A large database of 2 million images, which can also be downloaded and used as open source.