DiscussNav

In this work, we introduce a novel zero-shot VLN framework. Within this framework, large models possessing distinct abilities are served as domain experts. Our proposed navigation agent, namely DiscussNav, can actively discuss with these experts to collect essential information before moving at every step. These discussions cover critical navigation subtasks like instruction understanding, environment perception, and completion estimation. The performances on the representative VLN task R2R show that our method surpasses the leading zero-shot VLN model by a large margin on all metrics.

Requirements

Ubuntu 18.04.6 LTS

Python 3.8.17

Torch 1.13.1

Matterport3DSimulator

Recognize Anything (RAM)

InsturctBLIP

R2R Data

We have prepared R2R Val Unseen data in the tasks/data directory.

Run DiscussNav

python DiscussNav.py

BibTex

Please cite our paper if you find it helpful :)

@article{long2023discuss,
  title={Discuss before moving: Visual language navigation via multi-expert discussions},
  author={Long, Yuxing and Li, Xiaoqi and Cai, Wenzhe and Dong, Hao},
  journal={arXiv preprint arXiv:2309.11382},
  year={2023}
}

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
tasks/data		tasks/data
DiscussNav.gif		DiscussNav.gif
DiscussNav.py		DiscussNav.py
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiscussNav

Requirements

R2R Data

Run DiscussNav

BibTex

About

Releases

Packages

Languages

LYX0501/DiscussNav

Folders and files

Latest commit

History

Repository files navigation

DiscussNav

Requirements

R2R Data

Run DiscussNav

BibTex

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages