中国科学院机构知识库网格系统: Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning

文献类型：会议论文


作者	Xingyuan Gao1,2 ; Xiaoguang Zhao2 ; Min Tan2
出版日期	2019-09
会议日期	2019-08-22至2019-08-26
会议地点	Vancouver, British Columbia, Canada
英文摘要	Navigation in an efficient and socially normative manner is essential for the robot to operate in human populated environments. Traditional methods treat the pedestrians as dynamic obstacles and design a manual cost function for collision avoidance, but neglect social norms in navigation and do not generalize well to new environments. In this paper, we propose a mixture model to capture the human navigation behaviors in terms of the features of the continuous trajectories and discrete navigation decisions, such as passing on the left or right. The lower level of the model aims to generate socially normative trajectories. To this end, we extend inverse reinforcement learning (IRL) framework to a motion planner called Timed Elastic Band to learn from demonstrations. The upper level comprises a discrete distribution over the homotopy classes of the trajectories. IRL algorithm is employed to find the parameters of distribution that match demonstrations best. Experiments demonstrate that our learning algorithm has the capacity to recover the human navigation behaviors that respect social norms, which makes our approach outperform state-of-the-art methods in social navigation scenarios.
语种	英语
源URL	[http://ir.ia.ac.cn/handle/173211/57461]
专题	智能机器人系统研究
作者单位	1.University of Chinese Academy of Sciences 2.State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Xingyuan Gao,Xiaoguang Zhao,Min Tan. Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning[C]. 见:. Vancouver, British Columbia, Canada. 2019-08-22至2019-08-26.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。