中国科学院机构知识库网格系统: A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance

文献类型：会议论文


作者	Xu YF(徐一凡); Pu ZQ(蒲志强); Wu SG(吴士广); Liu BY(刘博寅); Yi JQ(易建强); Geng HJ(耿虎军); Chai XH(柴兴华)
出版日期	2022-07
会议日期	2022-2
会议地点	online
英文摘要	Multi-target coverage with connectivity maintenance at scale remains challenging in current research. A novel double-observation pol- icy learning framework (DOPLF) aiming at multi-agent system deploy- ment in large scale coverage problems is proposed in this work. DOPLF introduces observations from both global and local perspectives to en- courage more e cient exploration in large scale coverage scenarios with massive state spaces. Specifically, the local-level observation is derived from target partition to provide regional target density for each agent, and global-level observation provides an overall information of the en- vironment. Both observations are then fed into the subsequent learn- ing modules that primarily adopt graph attention network and proximal policy optimization based reinforcement learning algorithm to generate a distributed policy. Further, curriculum learning is applied to enhance the model adaptability in larger team sizes. Finally, the proposed method outperforms the baseline method in coverage rate and training e ciency in simulations with the number of targets ranging from 50 to 500.
源URL	[http://ir.ia.ac.cn/handle/173211/57457]
专题	综合信息系统研究中心_飞行器智能技术
作者单位	Institute of Automation, Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Xu YF,Pu ZQ,Wu SG,et al. A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance[C]. 见:. online. 2022-2.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。