NTU-RGB+D 120

Skeleton-based action recognition has attracted considerable attention due to its compact representation of the human body’s skeletal structure. Many recent methods have achieved remarkable performance using graph convoluitional networks (GCNs) and convolutional neural networks (CNNs), which extract spatial and temporal features, respectively. Although spatial and temporal dependencies in the human skeleton have been explored separately, spatio-temporal dependency is rarely considered.

Data and Resources

Cite this as

Jungho Lee, Minhyeok Lee, Suhwan Cho, Sungmin Woo, Sungjun Jang, Sangyoun Lee (2024). Dataset: NTU-RGB+D 120. https://doi.org/10.57702/mljyffpb

DOI retrieved: December 3, 2024

Additional Info

Field Value
Created December 3, 2024
Last update December 3, 2024
Defined In https://doi.org/10.48550/arXiv.2212.04761
Author Jungho Lee
More Authors
Minhyeok Lee
Suhwan Cho
Sungmin Woo
Sungjun Jang
Sangyoun Lee
Homepage https://github.com/Jho-Yonsei/STC-Net