Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning
文献类型:会议论文
作者 | Xingyuan Gao1,2![]() ![]() ![]() |
出版日期 | 2019-09 |
会议日期 | 2019-08-22至2019-08-26 |
会议地点 | Vancouver, British Columbia, Canada |
英文摘要 | Navigation in an efficient and socially normative manner is essential for the robot to operate in human populated environments. Traditional methods treat the pedestrians as dynamic obstacles and design a manual cost function for collision avoidance, but neglect social norms in navigation and do not generalize well to new environments. In this paper, we propose a mixture model to capture the human navigation behaviors in terms of the features of the continuous trajectories and discrete navigation decisions, such as passing on the left or right. The lower level of the model aims to generate socially normative trajectories. To this end, we extend inverse reinforcement learning (IRL) framework to a motion planner called Timed Elastic Band to learn from demonstrations. The upper level comprises a discrete distribution over the homotopy classes of the trajectories. IRL algorithm is employed to find the parameters of distribution that match demonstrations best. Experiments demonstrate that our learning algorithm has the capacity to recover the human navigation behaviors that respect social norms, which makes our approach outperform state-of-the-art methods in social navigation scenarios. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/57461] ![]() |
专题 | 智能机器人系统研究 |
作者单位 | 1.University of Chinese Academy of Sciences 2.State Key Laboratory of Management and Control for Complex Systems, Institute of Automation, Chinese Academy of Sciences |
推荐引用方式 GB/T 7714 | Xingyuan Gao,Xiaoguang Zhao,Min Tan. Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning[C]. 见:. Vancouver, British Columbia, Canada. 2019-08-22至2019-08-26. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。