SceneGenie: Scene Graph Guided Diffusion Models for Image Synthesis

Text-conditioned image generation has made significant progress in recent years with generative adversarial networks and more recently, diffusion models. The proposed guidance approach for the sampling process in the diffusion model that leverages bounding box and segmentation map information at inference time without additional training data.

BibTex: