1 dataset found

Tags: CLIP-Guided Generative Latent Space Search

Filter Results
  • CLIP-GLaSS

    The dataset used for the text-to-image task consists of 20 context tokens, to which three fixed tokens have been concatenated, representing the static context "the picture of".
You can also access this registry using the API (see API Docs).