中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning

文献类型:会议论文

作者Sui Zezhi1,2; Pu Zhiqiang1,2; Yi Jianqiang1,2; Tan Xiangmin1,2
出版日期2018-07
会议日期July 8-13, 2018
会议地点Rio de Janeiro, Brazil
英文摘要

A parallel deep Q-network (DQN) algorithm is presented for solving multiagent constrained formation path planning, where reaching destination, avoiding obstacles, and maintaining formation are simultaneously considered as independent or interactive tasks. Parallel Q-networks are utilized for each agent to sense different feature information and learn independent behavior policy. Comprehensive reward function is designed in consideration of respective requirements and interaction constraints to correctly guide the training. In order to demonstrate the effectiveness of the algorithm, we build an end-to-end model by designing a pixel game. Both training and testing are carried out in the game with double dueling DQN and the results show that the parallel deep Q-network path planner eventually complete the three tasks very well.

会议录出版者Institute of Electrical and Electronics Engineers Inc
语种英语
源URL[http://ir.ia.ac.cn/handle/173211/39696]  
专题自动化研究所_综合信息系统研究中心
作者单位1.Institute of Automation, Chinese Academy of Sciences Beijing, 100190,China
2.University of Chinese Academy of Sciences Beijing, 100049, China
推荐引用方式
GB/T 7714
Sui Zezhi,Pu Zhiqiang,Yi Jianqiang,et al. Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning[C]. 见:. Rio de Janeiro, Brazil. July 8-13, 2018.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。