7 datasets found

Tags: Multimodal Interaction

Filter Results
  • Voxicon

    Voxicon is a library of voxemes that represent objects, actions, and relations.
  • VoxML

    VoxML is a modeling language used to map natural language expressions into real-time visualizations using commonsense semantic knowledge of objects and events.
  • GTA: A Benchmark for General Tool Agents

    GTA is a benchmark for General Tool Agents, featuring three main aspects: real user queries, real deployed tools, and real multimodal inputs.
  • Beat

    Beat: A large-scale semantic and emotional multi-modal dataset for conversational gestures synthesis
  • Talking With Hands

    Talking With Hands 16.2 m: A large-scale dataset of synchronized body-finger motion and audio for conversational motion analysis and synthesis
  • GENEA Challenge 2023

    The GENEA Challenge 2023: A large-scale evaluation of gesture generation models in monadic and dyadic settings
  • DiffuseStyleGesture+

    The DiffuseStyleGesture+ entry to the GENEA Challenge 2023
You can also access this registry using the API (see API Docs).