-
Affordance-centric Question-driven Task Completion
A new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user’s view. -
INRIA YouTube Instructional Videos
The INRIA YouTube Instructional Videos dataset contains five tasks of different instructional domains: “making coffee”, “changing a car tire”, “CPR”, “jumping a car”, and... -
NIV dataset
The dataset used in the paper is a dataset of instructional videos, which includes 150 videos depicting 5 daily tasks, with an average of 9.5 actions per video. -
CrossTask dataset
The dataset used in the paper is a dataset of instructional videos, which includes 2,750 videos spanning 18 different tasks, with an average of 7.6 actions per video. -
COIN dataset
The dataset used in the paper is a large-scale instructional video dataset, which includes 11,827 videos involving 180 different tasks, with an average of 3.6 actions per video. -
Cross-task weakly supervised learning from instructional videos
Weakly supervised action segmentation dataset