Dataset - LDM

R4R Dataset

The R4R dataset is a larger VLN dataset than R2R and with more complicated navigation paths.
- Dataset
- JSON
R2R Dataset

The R2R dataset is a dataset based on real photos taken in indoor environments. It attracts massive attention for its simple-form task, which at the same time requires complex...
- Dataset
- JSON
Room-to-Room

The Room-to-Room dataset is a photo-realistic dataset for vision-and-language navigation, where agents navigate through indoor environments based on natural language instructions.
- Dataset
- JSON
Vision-and-Language Navigation

The Vision-and-Language Navigation (VLN) task gives a global natural sentence I = {w0,..., wl} as an instruction, where wi is a word token while the l is the length of the...
- Dataset
- JSON

You can also access this registry using the API (see API Docs).

4 datasets found