Vision-and-Language Navigation - Groups - LDM

Room-to-Room (R2R) dataset

The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three...
- Dataset
- JSON

Before browse our site, please accept our cookies policy