C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

doi:doi:10.57702/6acjrrad

C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model

Co-speech gesture generation is crucial for automatic digital avatar animation. However, existing methods suffer from issues such as unstable training and temporal inconsistency, particularly in generating high-fidelity and comprehensive gestures. Additionally, these methods lack effective control over speaker identity and temporal editing of the generated gestures.

Data and Resources

Original MetadataJSON
The json representation of the dataset with its distributions based on DCAT.
Explore
- Preview
- Download

Cite this as

Longbin Ji, Pengfei Wei, Yi Ren, Jinglin Liu, Chen Zhang, Xiang Yin (2024). Dataset: C2G2: Controllable Co-speech Gesture Generation with Latent Diffusion Model. https://doi.org/10.57702/6acjrrad

DOI retrieved: December 16, 2024

Additional Info

Field	Value
Created	December 16, 2024
Last update	December 16, 2024
Defined In	https://doi.org/10.48550/arXiv.2308.15016
Author	Longbin Ji
More Authors	Pengfei Wei Yi Ren Jinglin Liu Chen Zhang Xiang Yin
Homepage	https://c2g2-gesture.github.io/c2 gesture/