Dataset - LDM

Multi-Modal CelebA-HQ

A large-scale face image dataset that contains real face images and corresponding semantic segmentation map, sketch, and textual descriptions.
- Dataset
- JSON
T2M-GPT: Generating Human Motion from Textual Descriptions with Discrete Repr...

Generating motion from textual descriptions can be used in numerous applications in the game industry, ﬁlm-making, and animating robots. For example, a typical way to access new...
- Dataset
- JSON
HumanAct12

HumanAct12 dataset is a large-scale 3D human motion dataset with textual descriptions.
- Dataset
- JSON
HumanML3D

HumanML3D is a text-to-motion dataset built upon AMASS dataset and HumanAct12. It provides a wide range of motion-language pairs which cover ordinary activities, such as...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found