中国科学院机构知识库网格系统: Learning Heterogeneous Agent Cooperation via Multiagent League Training

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Learning Heterogeneous Agent Cooperation via Multiagent League Training

文献类型：期刊论文


作者	Qingxu, Fu1,2 ; Xiaolin Ai1,2 ; Jianqiang Yi1,2 ; Tenghai Qiu1,2 ; Wanmai Yuan 1,2; Zhiqiang Pu1,2
刊名	IFAC World Congress
出版日期	2023
页码	IFAC PapersOnLine 56-2 (2023) 3033-3040
英文摘要	Many multiagent systems in the real world include multiple types of agents with different abilities and functionality. Such heterogeneous multiagent systems have significant practical advantages. However, they also come with challenges compared with homogeneous systems for multiagent reinforcement learning, such as the non-stationary problem and the policy version iteration issue. This work proposes a general-purpose reinforcement learning algorithm named Heterogeneous League Training (HLT) to address heterogeneous multiagent problems. HLT keeps track of a pool of policies that agents have explored during training, gathering a league of heterogeneous policies to facilitate future policy optimization. Moreover, a hyper-network is introduced to increase the diversity of agent behaviors when collaborating with teammates having different levels of cooperation skills. We use heterogeneous benchmark tasks to demonstrate that (1) HLT promotes the success rate in cooperative heterogeneous tasks; (2) HLT is an effective approach to solving the policy version iteration problem; (3) HLT provides a practical way to assess the difficulty of learning each role in a heterogeneous team.
语种	英语
源URL	[http://ir.ia.ac.cn/handle/173211/57220]
专题	综合信息系统研究中心_飞行器智能技术
作者单位	1.中国科学院自动化研究所 2.中国科学院大学
推荐引用方式 GB/T 7714	Qingxu, Fu,Xiaolin Ai,Jianqiang Yi,et al. Learning Heterogeneous Agent Cooperation via Multiagent League Training[J]. IFAC World Congress,2023:IFAC PapersOnLine 56-2 (2023) 3033-3040.
APA	Qingxu, Fu,Xiaolin Ai,Jianqiang Yi,Tenghai Qiu,Wanmai Yuan,&Zhiqiang Pu.(2023).Learning Heterogeneous Agent Cooperation via Multiagent League Training.IFAC World Congress,IFAC PapersOnLine 56-2 (2023) 3033-3040.
MLA	Qingxu, Fu,et al."Learning Heterogeneous Agent Cooperation via Multiagent League Training".IFAC World Congress (2023):IFAC PapersOnLine 56-2 (2023) 3033-3040.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。