中国科学院机构知识库网格系统: Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration

文献类型：期刊论文


作者	Zezhi Sui2,3 ; Zhiqiang Pu1,2,3 ; Jianqiang Yi1,2,3 ; Shiguang Wu2,3
刊名	IEEE Transactions on Neural Networks and Learning Systems
出版日期	2021
卷号	32 期号:6 页码:2358-2372
关键词	Collision avoidance deep reinforcement learning (DRL) formation control leader–follower
DOI	10.1109/TNNLS.2020.3004893
英文摘要	Generating collision-free, time-efficient paths in an uncertain dynamic environment poses huge challenges for the formation control with collision avoidance (FCCA) problem in a leader–follower structure. In particular, the followers have to take both formation maintenance and collision avoidance into account simultaneously. Unfortunately, most of the existing works are simple combinations of methods dealing with the two problems separately. In this article, a new method based on deep reinforcement learning (RL) is proposed to solve the problem of FCCA. Especially, the learning-based policy is extended to the field of formation control, which involves a two-stage training framework: an imitation learning (IL) and later an RL. In the IL stage, a model-guided method consisting of a consensus theory-based formation controller and an optimal reciprocal collision avoidance strategy is designed to speed up training and increase efficiency. In the RL stage, a compound reward function is presented to guide the training. In addition, we design a formation-oriented network structure to perceive the environment. Long short-term memory is adopted to enable the network structure to perceive the information of obstacles of an uncertain number, and a transfer training approach is adopted to improve the generalization of the network in different scenarios. Numerous representative simulations are conducted, and our method is further deployed to an experimental platform based on a multiomnidirectional-wheeled car system. The effectiveness and practicability of our proposed method are validated through both the simulation and experiment results.
URL标识	查看原文
语种	英语
源URL	[http://ir.ia.ac.cn/handle/173211/47423]
专题	综合信息系统研究中心_飞行器智能技术
通讯作者	Zhiqiang Pu
作者单位	1.Taizhou Institute of Intelligent Manufacturing 2.School of Artificial Intelligence, University of Chinese Academy of Sciences 3.Institute of Automation, Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Zezhi Sui,Zhiqiang Pu,Jianqiang Yi,et al. Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration[J]. IEEE Transactions on Neural Networks and Learning Systems,2021,32(6):2358-2372.
APA	Zezhi Sui,Zhiqiang Pu,Jianqiang Yi,&Shiguang Wu.(2021).Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration.IEEE Transactions on Neural Networks and Learning Systems,32(6),2358-2372.
MLA	Zezhi Sui,et al."Formation control with collision avoidance through deep reinforcement learning using model-guided demonstration".IEEE Transactions on Neural Networks and Learning Systems 32.6(2021):2358-2372.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。