3 datasets found

Groups: Keystep Recognition Formats: JSON

Filter Results
  • HowTo100M

    The dataset used in the LORD framework for autonomous driving, consisting of images, videos, and text-based observations.
  • CrossTask

    The CrossTask dataset contains 2,750 instructional videos, annotated for 133 keystep labels spanning 18 tasks.
  • COIN

    The COIN dataset is a large-scale instructional video dataset that contains 100 hours of video. The dataset is used for instructional video analysis and understanding.