1 dataset found

Groups: Vision and Language Organizations: No Organization Formats: JSON

Filter Results
  • Room-to-Room (R2R) dataset

    The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three...