Video Action Localization - Groups

AVA v2.2

The AVA v2.2 dataset for spatiotemporal action localization contains the bounding box annotations and the corresponding action labels on keyframes.
- Dataset
- JSON
CrossTask

The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.
- Dataset
- JSON

Before browse our site, please accept our cookies policy

2 datasets found