中国科学院机构知识库网格系统: Hierarchical Cooperative Swarm Policy Learning with Role Emergence

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Hierarchical Cooperative Swarm Policy Learning with Role Emergence

文献类型：会议论文


作者	Zhang TL(张天乐)1,2 ; Liu Z(刘振)1,2 ; Pu ZQ(蒲志强)1,2 ; Qiu TH(丘腾海)1,2 ; Yi JQ(易建强)1,2
出版日期	2021
会议日期	05-07 December 2021
会议地点	Online
英文摘要	Swarm systems can cooperatively and efficiently accomplish specified complex tasks. Recent works have shown the potential of multi-agent reinforcement learning methods to study behavior policies of swarm systems. However, it is difficult for them to complete complex swarm tasks efficiently. In human society, role assignment can effectively help humans understand complex tasks and decompose them into simple certain subtasks. Inspired by this, we propose a two-level hierarchical cooperative swarm policy learning framework with role emergence based on hierarchical deep reinforcement learning for distributed swarm systems. In this framework, roles are dynamic and emergent. Agents with the same role tend to collectively complete a certain subtask. Specifically, each agent uses a higher-level swarm policy to dynamically select a role for itself in a role space and at a higher temporal scale, while it uses a lower-level swarm policy to perform the responsibilities of the selected role in a primitive action space. Meanwhile, hierarchical swarm policies with partial observation are centrally trained and decentrally executed, where agents’ local interaction modules and extrinsic team rewards are designed to promote cooperation among agents. In addition, an intrinsic reward is defined to enable different roles to be identified by agents’ longer-term behaviors, which implicitly associates the roles with responsibilities. Simulation results show that our method can learn and generate emergent, dynamic and identifiable roles, which helps swarm systems to reliably and efficiently accomplish complex tasks in a shorter time.
会议录出版者	IEEE
语种	英语
源URL	[http://ir.ia.ac.cn/handle/173211/51964]
专题	综合信息系统研究中心_飞行器智能技术
通讯作者	Liu Z(刘振)
作者单位	1.中国科学院自动化研究所 2.中国科学院大学人工智能学院
推荐引用方式 GB/T 7714	Zhang TL,Liu Z,Pu ZQ,et al. Hierarchical Cooperative Swarm Policy Learning with Role Emergence[C]. 见:. Online. 05-07 December 2021.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。