2 datasets found

Tags: video action localization

Filter Results
  • AVA v2.2

    The AVA v2.2 dataset for spatiotemporal action localization contains the bounding box annotations and the corresponding action labels on keyframes.
  • CrossTask

    The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.
You can also access this registry using the API (see API Docs).