中国科学院机构知识库网格系统: Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions

文献类型：期刊论文


作者	Qingxu Fu1,2 ; Zhiqiang Pu1,2 ; Yi Pan 1,2; Tenghai Qiu1,2 ; Jianqiang Yi1,2
刊名	IEEE Transactions on Fuzzy Systems
出版日期	2024
页码	1
英文摘要	A large proportion of recent studies on cooperative Multi-Agent Reinforcement Learning (MARL) focus on the policy-learning process in scenarios with stationary opponents (or without opponents). This paper, instead, investigates a different challenge of achieving team superiority in dynamic competitions among competitors that evolve dynamically with MARL. We aim to enhance the competitiveness of such MARL learners by enabling them to adjust their own learning settings dynamically, so as to take quick counter-measures against the policy shift of competitor learners, or to learn faster to suppress the opponents. We propose a Competitive Auto-Multiagent Learner with Fuzzy Feedback (CALF) with two essential highlights: (1) CALF establishes feedback controllers to achieve real-time adjustments based on fuzzy logic, using human-readable fuzzy rules to provide significant explainability and flexibility; (2) CALF integrates Bayesian Optimization to search and optimize the feedback fuzzy logic rules automatically. CALF can be used to apply real-time adjustments for MARL hyperparameters and intrinsic rewards. We also give solid empirical results to show that CALF significantly promotes team competitiveness in adversarial competitions, spanning from small-scale tasks involving 2 teams to large-scale tasks involving 3 teams and hundreds of agents. Furthermore, CALF exhibits superior competitiveness when engaging in competition with established competitors like Qmix, Qtran, and Qplex in dynamic competitive environments. Moreover, the experiments also demonstrate that the integration of the fuzzy logic with Bayesian Optimization offers considerable transferability and explainability, enabling a CALF-implemented learner optimized from one scenario to be transferred to other distinct scenarios.
源URL	[http://ir.ia.ac.cn/handle/173211/57223]
专题	综合信息系统研究中心_飞行器智能技术
作者单位	1.中国科学院自动化研究所 2.中国科学院大学
推荐引用方式 GB/T 7714	Qingxu Fu,Zhiqiang Pu,Yi Pan,et al. Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions[J]. IEEE Transactions on Fuzzy Systems,2024:1.
APA	Qingxu Fu,Zhiqiang Pu,Yi Pan,Tenghai Qiu,&Jianqiang Yi.(2024).Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions.IEEE Transactions on Fuzzy Systems,1.
MLA	Qingxu Fu,et al."Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions".IEEE Transactions on Fuzzy Systems (2024):1.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。