Zero-shot Object Goal Visual Navigation
文献类型:会议论文
作者 | Qianfan Zhao1; Lu Zhang1![]() ![]() ![]() |
出版日期 | 2023-05 |
会议日期 | May 29 - June 2, 2023 |
会议地点 | London, UK |
英文摘要 | Object goal visual navigation is a challenging task that aims to guide a robot to find the target object based on its visual observation, and the target is limited to the classes pre-defined in the training stage. However, in real households, there may exist numerous target classes that the robot needs to deal with, and it is hard for all of these classes to be contained in the training stage. To address this challenge, we study the zero-shot object goal visual navigation task, which aims at guiding robots to find targets belonging to novel classes without any training samples. To this end, we also propose a novel zero-shot object navigation framework called semantic similarity network (SSNet). Our framework use the detection results and the cosine similarity between semantic word embeddings as input. Such type of input data has a weak correlation with classes and thus our framework has the ability to generalize the policy to novel classes. Extensive experiments on the AI2-THOR platform show that our model outperforms the baseline models in the zero-shot object navigation task, which proves the generalization ability of our model. Our code is available at: https://github.com/pioneer-innovation/Zero-Shot-Object-Navigation. |
会议录出版者 | IEEE |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/57276] ![]() |
专题 | 多模态人工智能系统全国重点实验室 |
通讯作者 | Zhiyong Liu |
作者单位 | 1.State Key Laboratory of Multimodal Artifi cial Intelligence System, Institute of Automation, Chinese Academy of Sciences 2.Nanjing Artifi cial Intelligence Research of IA, Jiangning District 3.College of Electronic and Information Engineering, Tongji University |
推荐引用方式 GB/T 7714 | Qianfan Zhao,Lu Zhang,Bin He,et al. Zero-shot Object Goal Visual Navigation[C]. 见:. London, UK. May 29 - June 2, 2023. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。