中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance

文献类型:会议论文

作者Xu YF(徐一凡); Pu ZQ(蒲志强); Wu SG(吴士广); Liu BY(刘博寅); Yi JQ(易建强); Geng HJ(耿虎军); Chai XH(柴兴华)
出版日期2022-07
会议日期2022-2
会议地点online
英文摘要

Multi-target coverage with connectivity maintenance at scale
remains challenging in current research. A novel double-observation pol-
icy learning framework (DOPLF) aiming at multi-agent system deploy-
ment in large scale coverage problems is proposed in this work. DOPLF
introduces observations from both global and local perspectives to en-
courage more e cient exploration in large scale coverage scenarios with
massive state spaces. Specifically, the local-level observation is derived
from target partition to provide regional target density for each agent,
and global-level observation provides an overall information of the en-
vironment. Both observations are then fed into the subsequent learn-
ing modules that primarily adopt graph attention network and proximal
policy optimization based reinforcement learning algorithm to generate
a distributed policy. Further, curriculum learning is applied to enhance
the model adaptability in larger team sizes. Finally, the proposed method
outperforms the baseline method in coverage rate and training e ciency
in simulations with the number of targets ranging from 50 to 500.

源URL[http://ir.ia.ac.cn/handle/173211/57457]  
专题综合信息系统研究中心_飞行器智能技术
作者单位Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Xu YF,Pu ZQ,Wu SG,et al. A Double-Observation Policy Learning Framework for Multi-target Coverage with Connectivity Maintenance[C]. 见:. online. 2022-2.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。