2 datasets found

Tags: single entity

Filter Results
  • Room-to-Room (R2R) dataset

    The Room-to-Room (R2R) dataset is a benchmark for vision-and-language navigation tasks. It consists of 7,189 paths sampled from its navigation graphs, each with three...
  • REVERIE dataset

    The REVERIE dataset is a dataset of household tasks in an indoor environment. It contains images annotated with natural language instructions including the referring expressions...