-
NIV dataset
The dataset used in the paper is a dataset of instructional videos, which includes 150 videos depicting 5 daily tasks, with an average of 9.5 actions per video. -
CrossTask dataset
The dataset used in the paper is a dataset of instructional videos, which includes 2,750 videos spanning 18 different tasks, with an average of 7.6 actions per video.