-
FashionEngine: Interactive 3D Human Generation and Editing via Multimodal Con...
FashionEngine is an interactive 3D human generation and editing system that enables easy and efficient production of 3D digital humans with multimodal control. -
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Me...
A text-driven animated human mesh synthesis system that leverages multi-modal aware and semantic textual matching. -
VideoAttentionTarget
VideoAttentionTarget is a video-based gaze target dataset comprising 71,666 frames from 1,331 clips. -
GazeFollow
GazeFollow is a large-scale dataset consisting of 122,143 images with 130,339 annotations on head-target instances. -
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
Gaze target detection aims to directly associate individuals and their gaze targets within a single image or across multiple video frames. -
Total capture: A 3D deformation model for tracking faces, hands, and bodies
Dataset for tracking faces, hands, and bodies in videos. -
Video based reconstruction of 3D people models
Real-world dataset for reconstructing 3D people models from monocular video. -
HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Tex...
Real-world datasets for reconstructing human avatars from monocular video, including ZJU-MoCap and People-Snapshot.