CLIP-Mesh: Generating Textured Meshes from Text Using Pretrained Image-Text Models

The dataset used in the paper is not explicitly described, but it is mentioned that the authors used a pre-trained image-text model.

BibTex: