Dataset - LDM

HowToStep

The HowToStep dataset is a large-scale instructional dataset constructed for training, by transforming the original transcripts of HTM-370K into around 4M ordered instructional...
- Dataset
- JSON
Affordance-centric Question-driven Task Completion

A new task called Affordance-centric Question-driven Task Completion, where the AI assistant should learn from instructional videos to provide step-by-step help in the user’s view.
- Dataset
- JSON
INRIA YouTube Instructional Videos

The INRIA YouTube Instructional Videos dataset contains five tasks of different instructional domains: “making coffee”, “changing a car tire”, “CPR”, “jumping a car”, and...
- Dataset
- JSON
NIV dataset

The dataset used in the paper is a dataset of instructional videos, which includes 150 videos depicting 5 daily tasks, with an average of 9.5 actions per video.
- Dataset
- JSON
CrossTask dataset

The dataset used in the paper is a dataset of instructional videos, which includes 2,750 videos spanning 18 different tasks, with an average of 7.6 actions per video.
- Dataset
- JSON
COIN dataset

The dataset used in the paper is a large-scale instructional video dataset, which includes 11,827 videos involving 180 different tasks, with an average of 3.6 actions per video.
- Dataset
- JSON
HowTo100M

The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.
- Dataset
- JSON
CrossTask

The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.
- Dataset
- JSON
Cross-task weakly supervised learning from instructional videos

Weakly supervised action segmentation dataset
- Dataset
- JSON
COIN

The COIN dataset is a large-scale instructional video dataset that contains 100 hours of video. The dataset is used for instructional video analysis and understanding.
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

10 datasets found