MUSE: Text-to-Image Generation via Masked Generative Transformers

MUSE is a text-to-image generation model that uses masked generative transformers.

BibTex: