中国科学院机构知识库网格系统: QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning

文献类型：期刊论文


作者	Liu BY(刘博寅)
刊名	IEEE Transactions on Cognitive and Developmental Systems
出版日期	2024
页码	12
英文摘要	In multi-agent reinforcement learning (MARL), agents must learn to cooperate by observing the environment and selecting actions that maximize their rewards. However, this learning process can be hampered by myopia, wherein agents' strategies fail to consider the long-term consequences of their actions. A primary reason for this problem is the inaccurate estimation of the long-term value of each action. Socially, humans derive future expectation cognition from available information to anticipate potential future outcomes and adjust their actions accordingly to avoid myopia. Motivated by these insights, this paper proposes a novel framework called QFuture to address the myopia problem. Specifically, we first design a future expectation cognition module (FECM) in this framework to build future expectation cognition in the calculation of individual action-value (IAV) and joint action-value (JAV). We model future expectation cognition as random variables in FECM, which learn representation by maximizing mutual information with the future trajectory based on current information. Furthermore, a return-based regularizer is designed to reflect "expectation" and ensure informativeness in the future expectation representation module (FERM) which encodes the future trajectory. Experiments on StarCraft II micromanagement tasks and Google Research Football show that QFuture achieves significant state-of-the-art performance. Demonstrative videos are available at \url{https://sites.google.com/view/qfuture}.
源URL	[http://ir.ia.ac.cn/handle/173211/58536]
专题	复杂系统认知与决策实验室_群体决策智能团队
推荐引用方式 GB/T 7714	Liu BY. QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning[J]. IEEE Transactions on Cognitive and Developmental Systems,2024:12.
APA	Liu BY.(2024).QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning.IEEE Transactions on Cognitive and Developmental Systems,12.
MLA	Liu BY."QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning".IEEE Transactions on Cognitive and Developmental Systems (2024):12.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。