Instructional Video Analysis - Groups

HowTo100M

The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.

Dataset
JSON

CrossTask

The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.

Dataset
JSON

COIN

The COIN dataset is a large-scale instructional video dataset that contains 100 hours of video. The dataset is used for instructional video analysis and understanding.

Dataset
JSON

3 datasets found

HowTo100M

CrossTask

COIN