中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features

文献类型:期刊论文

作者Wenzhang Liu; Lu Dong; Dan Niu; Changyin Sun
刊名IEEE/CAA Journal of Automatica Sinica
出版日期2022
卷号9期号:9页码:1673-1686
关键词Knowledge transfer multi-agent systems reinforcement learning successor features
ISSN号2329-9266
DOI10.1109/JAS.2022.105809
英文摘要In multi-agent reinforcement learning (MARL), the behaviors of each agent can influence the learning of others, and the agents have to search in an exponentially enlarged joint-action space. Hence, it is challenging for the multi-agent teams to explore in the environment. Agents may achieve suboptimal policies and fail to solve some complex tasks. To improve the exploring efficiency as well as the performance of MARL tasks, in this paper, we propose a new approach by transferring the knowledge across tasks. Differently from the traditional MARL algorithms, we first assume that the reward functions can be computed by linear combinations of a shared feature function and a set of task-specific weights. Then, we define a set of basic MARL tasks in the source domain and pre-train them as the basic knowledge for further use. Finally, once the weights for target tasks are available, it will be easier to get a well-performed policy to explore in the target domain. Hence, the learning process of agents for target tasks is speeded up by taking full use of the basic knowledge that was learned previously. We evaluate the proposed algorithm on two challenging MARL tasks: cooperative box-pushing and non-monotonic predator-prey. The experiment results have demonstrated the improved performance compared with state-of-the-art MARL algorithms.
源URL[http://ir.ia.ac.cn/handle/173211/49681]  
专题自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式
GB/T 7714
Wenzhang Liu,Lu Dong,Dan Niu,et al. Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features[J]. IEEE/CAA Journal of Automatica Sinica,2022,9(9):1673-1686.
APA Wenzhang Liu,Lu Dong,Dan Niu,&Changyin Sun.(2022).Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features.IEEE/CAA Journal of Automatica Sinica,9(9),1673-1686.
MLA Wenzhang Liu,et al."Efficient Exploration for Multi-Agent Reinforcement Learning via Transferable Successor Features".IEEE/CAA Journal of Automatica Sinica 9.9(2022):1673-1686.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。