Instructional Videos - Groups

HowToStep

The HowToStep dataset is a large-scale instructional dataset constructed for training, by transforming the original transcripts of HTM-370K into around 4M ordered instructional...

Dataset
JSON

How2Sign

How2Sign is a large-scale continuous American Sign Language (ASL) dataset. After removing invalid text-video pairs, we retain 31019, 1738, and 2348 available pairs in the...

Dataset
JSON

INRIA YouTube Instructional Videos

The INRIA YouTube Instructional Videos dataset contains five tasks of different instructional domains: “making coffee”, “changing a car tire”, “CPR”, “jumping a car”, and...

Dataset
JSON

HowTo100M

The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.

Dataset
JSON

CrossTask

The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.

Dataset
JSON

COIN

The COIN dataset is a large-scale instructional video dataset that contains 100 hours of video. The dataset is used for instructional video analysis and understanding.

Dataset
JSON