C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive gestures. Additionally, these methods lack effective control over speaker identity and temporal editing of the generated gestures.

Data and Resources

Cite this as

Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin (2024). Dataset: C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model. https://doi.org/10.57702/6acjrrad

DOI retrieved: December 16, 2024

Additional Info

Field Value
Created December 16, 2024
Last update December 16, 2024
Defined In https://doi.org/10.48550/arXiv.2308.15016
Author Longbin Ji
More Authors
Pengfei Wei
Yi Ren
Jinglin Liu
Chen Zhang
Xiang Yin
Homepage https://c2g2-gesture.github.io/c2 gesture/