Title
Vision-Language Navigation Policy Learning and Adaptation
Abstract
Vision-language navigation (VLN) is the task of navigating an embodied agent to carry out natural language instructions inside real 3D environments. In this paper, we study how to address three critical challenges for this task: the cross-modal grounding, the ill-posed feedback, and the generalization problems. First, we propose a novel Reinforced Cross-Modal Matching (RCM) approach that enforces ...
Year
DOI
Venue
2021
10.1109/TPAMI.2020.2972281
IEEE Transactions on Pattern Analysis and Machine Intelligence
Keywords
DocType
Volume
Navigation,Visualization,Trajectory,Task analysis,Cognition,Grounding,Natural languages
Journal
43
Issue
ISSN
Citations 
12
0162-8828
0
PageRank 
References 
Authors
0.34
7
8
Name
Order
Citations
PageRank
Xin Wang100.34
Qiuyuan Huang217617.66
Asli Çelikyilmaz340739.06
Jianfeng Gao45729296.43
Dinghan Shen510810.37
Yuan-Fang Wang600.34
William Yang Wang749359.64
Lei Zhang82533164.29