QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning
文献类型:期刊论文
作者 | Liu BY(刘博寅)![]() |
刊名 | IEEE Transactions on Cognitive and Developmental Systems
![]() |
出版日期 | 2024 |
页码 | 12 |
英文摘要 | In multi-agent reinforcement learning (MARL), agents must learn to cooperate by observing the environment and selecting actions that maximize their rewards. However, this learning process can be hampered by myopia, wherein agents' strategies fail to consider the long-term consequences of their actions. A primary reason for this problem is the inaccurate estimation of the long-term value of each action. Socially, humans derive future expectation cognition from available information to anticipate potential future outcomes and adjust their actions accordingly to avoid myopia. Motivated by these insights, this paper proposes a novel framework called QFuture to address the myopia problem. Specifically, we first design a future expectation cognition module (FECM) in this framework to build future expectation cognition in the calculation of individual action-value (IAV) and joint action-value (JAV). We model future expectation cognition as random variables in FECM, which learn representation by maximizing mutual information with the future trajectory based on current information. Furthermore, a return-based regularizer is designed to reflect "expectation" and ensure informativeness in the future expectation representation module (FERM) which encodes the future trajectory. Experiments on StarCraft II micromanagement tasks and Google Research Football show that QFuture achieves significant state-of-the-art performance. Demonstrative videos are available at \url{https://sites.google.com/view/qfuture}. |
源URL | [http://ir.ia.ac.cn/handle/173211/58536] ![]() |
专题 | 复杂系统认知与决策实验室_群体决策智能团队 |
推荐引用方式 GB/T 7714 | Liu BY. QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning[J]. IEEE Transactions on Cognitive and Developmental Systems,2024:12. |
APA | Liu BY.(2024).QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning.IEEE Transactions on Cognitive and Developmental Systems,12. |
MLA | Liu BY."QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning".IEEE Transactions on Cognitive and Developmental Systems (2024):12. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。