-
BoundaryDiffusion
The dataset used in the paper for semantic control and manipulation of images using pre-trained diffusion models. -
Diffusion-based Causal Models
The dataset used in the paper is a synthetic dataset generated with various structural equation types for all three forms of causal queries. -
UniCtrl: Improving the Spatiotemporal Consistency of Text-to-Video Diffusion ...
Video Diffusion Models have been developed for video generation, usually integrating text and image conditioning to enhance control over the generated content. -
MosaicFusion: Diffusion Models as Data Augmenters for Large Vocabulary Instan...
MosaicFusion: A simple yet effective diffusion-based data augmentation approach for large vocabulary instance segmentation. -
CADS: Unleash the diversity of diffusion models through condition-annealed sa...
CADS: Unleash the diversity of diffusion models through condition-annealed sampling. -
Self-Guided Generation of Minority Samples Using Diffusion Models
We present a novel approach for generating minority samples that live on low-density regions of a data manifold. -
SRNDiff: Short-term precipitation nowcasting with condition diffusion model
Diffusion models are widely used in image generation because they can generate high-quality and realistic samples. -
DOLFIN: DIFFUSION LAYOUT TRANSFORMERS WITHOUT AUTOENCODER
A novel generative model, Diffusion Layout Transformers without Autoencoder (Dolfin), which significantly improves the modeling capability with reduced complexity compared to... -
Robustness and Generalizability of Deepfake Detection: A Study with Diffusion...
A robustness and generalizability study of deepfake detection using diffusion models. -
Wavegrad: Estimating gradients for waveform generation
A method for estimating gradients for waveform generation using diffusion models. -
ERNIE-Music: Text-to-Waveform Music Generation with Diffusion Models
A music generation model that leverages free-form text as a conditioning factor, utilizing the diffusion model to generate waveform-based music. -
VecFusion: Vector Font Generation with Diffusion
A new neural architecture for generating vector fonts with varying topological structures and precise control point positions. -
ogbg-molhiv
ogbg-molhiv dataset is a graph classification dataset containing 41k molecules. -
Text-to-Image Diffusion Models
The dataset used for text-to-image diffusion models, including Bluefire, Paintings, 3D, and Origami styles. -
Bora: Biomedical Generalist Video Generation Model
The first spatio-temporal diffusion probabilistic model designed for text-guided biomedical video generation. -
Diffusion Models for Minimally-Supervised Speech Synthesis
Minimally-supervised speech synthesis method based on diffusion models with minimal supervision. Introduces the CTAP method as an intermediate semantic representation and uses... -
Visual Instruction Generation
The dataset used in the paper for visual instruction generation, containing 200 goals and instructions for various tasks. -
Videofactory
The Videofactory dataset, which is used for evaluating the performance of the InstructVid2Vid model.