You're currently viewing an old version of this dataset. To see the current version, click here.

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Training text-to-image models with web scale image-text pairs enables the generation of a wide range of visual concepts from text. However, these pre-trained models often face challenges when it comes to generating highly aesthetic images. This creates the need for aesthetic alignment post pre-training. In this paper, we propose quality-tuning to effectively guide a pre-trained model to exclusively generate highly visually appealing images, while maintaining generality across visual concepts.

Data and Resources

This dataset has no data

Cite this as

xiaoliangdai, jihou, cyma, sstsai, jialiangw, ruiw, stzpz (2024). Dataset: Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack. https://doi.org/10.57702/b5jlsdyr

Private DOI This DOI is not yet resolvable.
It is available for use in manuscripts, and will be published when the Dataset is made public.

Additional Info

Field Value
Created December 2, 2024
Last update December 2, 2024
Author xiaoliangdai
More Authors
jihou
cyma
sstsai
jialiangw
ruiw
stzpz