The Sanskrit text is annotated with various NLP tasks, including sentence boundary detection, canonical word ordering, free-form text annotation of tokens, token classification,...
The Objaverse dataset contains around 800k 3D objects. After adopting simple filter leveraging CLIP [27] to remove the objects whose rendered images are not relevant to its...