HumanML3D is a text-to-motion dataset built upon AMASS dataset and HumanAct12. It provides a wide range of motion-language pairs which cover ordinary activities, such as...
The dataset used in this paper is the BABEL dataset, which contains 10881 motion sequences, with 65926 subsequences and the corresponding textual labels.