Prompt-based Environmental Self-exploration
Vision-language navigation (VLN) is a challenging task due to its large searching space in the environment. To address this problem, previous works have proposed some methods of fine-tuning a large model pretrained on large-scale datasets.
BibTex: