中国科学院机构知识库网格系统: Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning

文献类型：会议论文


作者	Sui Zezhi1,2 ; Pu Zhiqiang1,2 ; Yi Jianqiang1,2 ; Tan Xiangmin1,2
出版日期	2018-07
会议日期	July 8-13, 2018
会议地点	Rio de Janeiro, Brazil
英文摘要	A parallel deep Q-network (DQN) algorithm is presented for solving multiagent constrained formation path planning, where reaching destination, avoiding obstacles, and maintaining formation are simultaneously considered as independent or interactive tasks. Parallel Q-networks are utilized for each agent to sense different feature information and learn independent behavior policy. Comprehensive reward function is designed in consideration of respective requirements and interaction constraints to correctly guide the training. In order to demonstrate the effectiveness of the algorithm, we build an end-to-end model by designing a pixel game. Both training and testing are carried out in the game with double dueling DQN and the results show that the parallel deep Q-network path planner eventually complete the three tasks very well.
会议录出版者	Institute of Electrical and Electronics Engineers Inc
语种	英语
源URL	[http://ir.ia.ac.cn/handle/173211/39696]
专题	自动化研究所_综合信息系统研究中心
作者单位	1.Institute of Automation, Chinese Academy of Sciences Beijing, 100190,China 2.University of Chinese Academy of Sciences Beijing, 100049, China
推荐引用方式 GB/T 7714	Sui Zezhi,Pu Zhiqiang,Yi Jianqiang,et al. Path Planning of Multiagent Constrained Formation through Deep Reinforcement Learning[C]. 见:. Rio de Janeiro, Brazil. July 8-13, 2018.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。