Two/Three-Object Prompts (TwOP/ThreeOP)

Text-to-Image Diffusion Models (T2I DMs) have garnered significant attention for their ability to generate high-quality images from textual descriptions. However, these models often produce images that do not fully align with the input prompts, resulting in semantic inconsistencies. The most prominent issue among these semantic inconsistencies is catastrophic-neglect, where the images generated by T2I DMs miss key objects mentioned in the prompt.

Data and Resources

Cite this as

Zhiyuan Chang, Mingyang Li, Junjie Wang, Yi Liu, Qing Wang, Yang Liu (2024). Dataset: Two/Three-Object Prompts (TwOP/ThreeOP). https://doi.org/10.57702/kmmbqmwo

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2406.16272
Author Zhiyuan Chang
More Authors
Mingyang Li
Junjie Wang
Yi Liu
Qing Wang
Yang Liu