中国科学院机构知识库网格系统: Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems

文献类型：会议论文


作者	Qingxu Fu1,2 ; Tenghai Qiu2 ; Jianqiang Yi1,2 ; Zhiqiang Pu1,2 ; Shiguang Wu1,2
出版日期	2022
会议日期	2022
会议地点	online
英文摘要	When dealing with a series of imminent issues, humans can naturally concentrate on a subset of these concerning issues by prioritizing them according to their contributions to motivational indices, e.g., the probability of winning a game. This idea of concentration offers insights into reinforcement learning of sophisticated Large-scale Multi-Agent Systems (LMAS) participated by hundreds of agents. In such an LMAS, each agent receives a long series of entity observations at each step, which can overwhelm existing aggregation networks such as graph attention networks and cause inefficiency. In this paper, we propose a concentration network called ConcNet. First, ConcNet scores the observed entities considering several motivational indices, e.g., expected survival time and state value of the agents, and then ranks, prunes, and aggregates the encodings of observed entities to extract features. Second, distinct from the well-known attention mechanism, ConcNet has a unique motivational subnetwork to explicitly consider the motivational indices when scoring the observed entities. Furthermore, we present a concentration policy gradient architecture that can learn effective policies in LMAS from scratch. Extensive experiments demonstrate that the presented architecture has excellent scalability and flexibility, and significantly outperforms existing methods on LMAS benchmarks.
源URL	[http://ir.ia.ac.cn/handle/173211/57209]
专题	综合信息系统研究中心_飞行器智能技术
通讯作者	Tenghai Qiu
作者单位	1.School of Artificial Intelligence, University of Chinese Academy of Sciences 2.Institute of Automation, Chinese Academy of Sciences
推荐引用方式 GB/T 7714	Qingxu Fu,Tenghai Qiu,Jianqiang Yi,et al. Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems[C]. 见:. online. 2022.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。