Learning Heterogeneous Agent Cooperation via Multiagent League Training
文献类型:期刊论文
作者 | Qingxu, Fu1,2![]() ![]() ![]() ![]() ![]() |
刊名 | IFAC World Congress
![]() |
出版日期 | 2023 |
页码 | IFAC PapersOnLine 56-2 (2023) 3033-3040 |
英文摘要 | Many multiagent systems in the real world include multiple types of agents with different abilities and functionality. Such heterogeneous multiagent systems have significant practical advantages. However, they also come with challenges compared with homogeneous systems for multiagent reinforcement learning, such as the non-stationary problem and the policy version iteration issue. This work proposes a general-purpose reinforcement learning algorithm named Heterogeneous League Training (HLT) to address heterogeneous multiagent problems. HLT keeps track of a pool of policies that agents have explored during training, gathering a league of heterogeneous policies to facilitate future policy optimization. Moreover, a hyper-network is introduced to increase the diversity of agent behaviors when collaborating with teammates having different levels of cooperation skills. We use heterogeneous benchmark tasks to demonstrate that (1) HLT promotes the success rate in cooperative heterogeneous tasks; (2) HLT is an effective approach to solving the policy version iteration problem; (3) HLT provides a practical way to assess the difficulty of learning each role in a heterogeneous team. |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/57220] ![]() |
专题 | 综合信息系统研究中心_飞行器智能技术 |
作者单位 | 1.中国科学院自动化研究所 2.中国科学院大学 |
推荐引用方式 GB/T 7714 | Qingxu, Fu,Xiaolin Ai,Jianqiang Yi,et al. Learning Heterogeneous Agent Cooperation via Multiagent League Training[J]. IFAC World Congress,2023:IFAC PapersOnLine 56-2 (2023) 3033-3040. |
APA | Qingxu, Fu,Xiaolin Ai,Jianqiang Yi,Tenghai Qiu,Wanmai Yuan,&Zhiqiang Pu.(2023).Learning Heterogeneous Agent Cooperation via Multiagent League Training.IFAC World Congress,IFAC PapersOnLine 56-2 (2023) 3033-3040. |
MLA | Qingxu, Fu,et al."Learning Heterogeneous Agent Cooperation via Multiagent League Training".IFAC World Congress (2023):IFAC PapersOnLine 56-2 (2023) 3033-3040. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。