-
Using Large Language Models to Simulate Multiple Humans
The dataset used in the paper to simulate human behavior in various experiments, including the Ultimatum Game, Garden Path Sentences, Milgram Shock Experiment, and Wisdom of... -
Voice-based 3D modeling for novices
The dataset used in the Wizard of Oz study to explore novice mental models in voice-based 3D modeling. -
FashionEngine: Interactive 3D Human Generation and Editing via Multimodal Con...
FashionEngine is an interactive 3D human generation and editing system that enables easy and efficient production of 3D digital humans with multimodal control. -
Lunar Lander
The dataset used in this paper is a collection of data points from a lunar lander, which is used to test the proposed APG algorithm for task switching. -
CLIP-Actor: Text-Driven Recommendation and Stylization for Animating Human Me...
A text-driven animated human mesh synthesis system that leverages multi-modal aware and semantic textual matching. -
Human Activity Recognition (HAR) dataset
The dataset used in this paper is a multiclass classification task where the goal is to correctly predict which of the 7 activities is being performed by the user. The... -
RF-Capture
RF-Capture: Capturing the Human Figure Through a Wall -
EGTEA Gaze+
The EGTEA Gaze+ dataset offers approximately 10,000 samples of 106 non-scripted daily activities that occur in a kitchen. -
VideoAttentionTarget
VideoAttentionTarget is a video-based gaze target dataset comprising 71,666 frames from 1,331 clips. -
GazeFollow
GazeFollow is a large-scale dataset consisting of 122,143 images with 130,339 annotations on head-target instances. -
GazeHTA: End-to-end Gaze Target Detection with Head-Target Association
Gaze target detection aims to directly associate individuals and their gaze targets within a single image or across multiple video frames. -
Total capture: A 3D deformation model for tracking faces, hands, and bodies
Dataset for tracking faces, hands, and bodies in videos. -
Video based reconstruction of 3D people models
Real-world dataset for reconstructing 3D people models from monocular video. -
HR Human: Modeling Human Avatars with Triangular Mesh and High-Resolution Tex...
Real-world datasets for reconstructing human avatars from monocular video, including ZJU-MoCap and People-Snapshot. -
Training a helpful and harmless assistant with reinforcement learning from hu...
The authors propose a novel approach that incorporates parameter-efficient tuning to better optimize control tokens, thus benefitting controllable generation.