-
LatteGAN: Visually Guided Language Attention for Multi-Turn Text-Conditioned ...
Text-guided image manipulation tasks have recently gained attention in the vision-and-language community. The GeNeVA task is a multi-turn text-conditioned image generation...