-
FFHQ, AFHQ-Cat, and LSUN-Church
The dataset used in the paper is a large dataset of images, including FFHQ, AFHQ-Cat, and LSUN-Church. -
FLIR, MFNet, COME15K, MCXFace
Cross-modal datasets with text descriptions for cross-modal image generation under various layout conditions. -
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
VisionLLaMA is a unified and generic modeling framework for solving most vision tasks. -
Portrait Diffusion: Training-free Face Stylization with Chain-of-Painting
Face stylization refers to the transformation of a face into a specific portrait style. However, current methods require the use of example-based adaptation approaches to... -
Concept Sliders Test Dataset
The dataset used for testing the Concept Sliders, consisting of paired image data and text prompts. -
Concept Sliders Dataset
The dataset used for training the Concept Sliders, consisting of paired image data and text prompts. -
GIU-GANs:Global Information Utilization for Generative Adversarial Networks
Recently, with the rapid development of artificial intelligence, image generation based on deep learning has advanced significantly. Image generation based on Generative... -
RoentGen: Vision-Language Foundation Model for Chest X-ray Generation
Multimodal models trained on large natural image-text pair datasets have exhibited astounding abilities in gener-ating high-quality images. Medical imaging data is fundamentally... -
Human Pose Transfer by Adaptive Hierarchical Deformation
Human pose transfer, as a misaligned image generation task, is very challenging. Existing methods cannot effectively utilize the input information, which often fail to preserve... -
FIT: Far-reaching Interleaved Transformers
We present FIT: a transformer-based architecture with efficient self-attention and adaptive computation. -
ControlNet dataset
ControlNet dataset for image generation -
DiffuGen: Adaptable Approach for Generating Labeled Image Datasets using Stab...
DiffuGen generates high-quality labeled image datasets using stable diffusion models. -
Expanding small-scale datasets with guided imagination
The dataset is used for image generation and editing tasks. -
Labeled Faces in the Wild
The dataset is a 4-way array of dimensions 4000 × 90 × 90 × 3, where each pixel gives the intensity for colors red, green and blue, resulting in a multiway array of dimensions X... -
NASA-space model
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a dataset to train and fine-tune the diffusion model. -
Stable Diffusion v1.4
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a dataset to train and fine-tune the diffusion model. -
Pokémon-LoRA model
The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a dataset to train and fine-tune the diffusion model.