3 datasets found

Tags: keystep recognition

Filter Results
  • HowTo100M

    The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.
  • CrossTask

    The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.
  • COIN

    The COIN dataset is a large-scale instructional video dataset that contains 100 hours of video. The dataset is used for instructional video analysis and understanding.
You can also access this registry using the API (see API Docs).