-
Words2Contact
The Words2Contact dataset contains verbal instructions for humanoid robots to place support contacts. -
CLEVR-Robot Environment
A benchmark for evaluating task compositionality and long-horizon tasks through object manipulation, with language serving as the mechanism for goal specification. -
LIMP Dataset
The dataset used in the paper is a set of 35 complex and ambiguous object goal navigation and mobile pick-and-place instructions. -
Crowd-sourced Language Annotations Dataset
The dataset consists of 5,600 episode-instruction pairs, where each episode is labeled with two hindsight instructions each. -
Data-driven Instruction Augmentation for Language-conditioned Control
Data-driven Instruction Augmentation for Language-conditioned Control (DIAL) is a method that uses pre-trained vision-language models (VLMs) to label offline datasets for... -
Vision-and-Language Navigation
The Vision-and-Language Navigation (VLN) task gives a global natural sentence I = {w0,..., wl} as an instruction, where wi is a word token while the l is the length of the... -
Deep Compositional Robotic Planners
A dataset for training a compositional hierarchical recurrent network to follow natural language commands in continuous environments. -
PhotoBot: Reference-Guided Interactive Photography via Natural Language
PhotoBot is a framework for fully automated photo acquisition based on an interplay between high-level human language guidance and a robot photographer. -
Validation Dataset
The Validation Dataset is used for validation, it contains 1428 images from nine distinct rooms.