Iterative Room-to-Room

Iterative Room-to-Room (IR2R) benchmark for evaluating language-guided agents navigating in a persistent environment over time.

BibTex: