1 dataset found

Tags: very long-form

Filter Results
  • EgoSchema

    EgoSchema is a diagnostic benchmark for assessing very long-form video-language understanding capabilities of modern multimodal systems.
You can also access this registry using the API (see API Docs).