Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

doi:doi:10.57702/b5jlsdyr

Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack

Training text-to-image models with web scale image-text pairs enables the generation of a wide range of visual concepts from text. However, these pre-trained models often face challenges when it comes to generating highly aesthetic images. This creates the need for aesthetic alignment post pre-training. In this paper, we propose quality-tuning to effectively guide a pre-trained model to exclusively generate highly visually appealing images, while maintaining generality across visual concepts.

BibTex:

@dataset{xiaoliangdai_and_jihou_and_cyma_and_sstsai_and_jialiangw_and_ruiw_and_stzpz_2024,
    abstract = {Training text-to-image models with web scale image-text pairs enables the generation of a wide range of visual concepts from text. However, these pre-trained models often face challenges when it comes to generating highly aesthetic images. This creates the need for aesthetic alignment post pre-training. In this paper, we propose quality-tuning to effectively guide a pre-trained model to exclusively generate highly visually appealing images, while maintaining generality across visual concepts.},
    author = {xiaoliangdai and jihou and cyma and sstsai and jialiangw and ruiw and stzpz},
    doi = {10.57702/b5jlsdyr},
    institution = {No Organization},
    keyword = {'aesthetic alignment', 'quality-tuning', 'text-to-image models'},
    month = {dec},
    publisher = {TIB},
    title = {Emu: Enhancing Image Generation Models Using Photogenic Needles in a Haystack},
    url = {https://service.tib.eu/ldmservice/dataset/emu--enhancing-image-generation-models-using-photogenic-needles-in-a-haystack},
    year = {2024}
}