Centralized Cooperative Exploration Policy for Continuous Control Tasks
文献类型:会议论文
作者 | Chao Li1![]() ![]() ![]() ![]() ![]() |
出版日期 | 2023-05 |
会议日期 | May 29–June 2, 2023 |
会议地点 | London, United Kingdom |
关键词 | continuous control tasks cooperative exploration |
DOI | 10.5555/3545946.3598965 |
页码 | 2454–2456 |
英文摘要 | Despite recent works making great progress in continuous control tasks, exploration in these tasks has remained insufficiently investigated. This paper proposes CCEP (C entralized C ooperative E xploration P olicy), which utilizes estimation biases of value functions to contribute to the exploration capacity. CCEP keeps two value functions initialized with different parameters, and generates diverse policies with multiple exploration styles from a pair of value functions. In addition, a centralized policy framework ensures that CCEP achieves message delivery between multiple policies, furthermore contributing to exploring the environment cooperatively. Extensive experimental results demonstrate that CCEP achieves higher exploration capacity. Empirical analysis shows diverse exploration styles in the learned policies by CCEP, reaping benefits in more exploration regions. Besides, the exploration capabilities of CCEP have been demonstrated to outperform current state-of-the-art methods on multiple continuous control tasks. |
会议录 | Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems
![]() |
会议录出版者 | International Foundation for Autonomous Agents and Multiagent Systems |
会议录出版地 | Richland, SC |
语种 | 英语 |
源URL | [http://ir.ia.ac.cn/handle/173211/56696] ![]() |
专题 | 自动化研究所_复杂系统管理与控制国家重点实验室_机器人应用与理论组 |
通讯作者 | Xinwen Hou; Yu Liu |
作者单位 | 1.Institute of Automation, Chinese Academy of Sciences, Beijing, China 2.University of Tubingen, Tubingen, Germany |
推荐引用方式 GB/T 7714 | Chao Li,Chen Gong,Qiang He,et al. Centralized Cooperative Exploration Policy for Continuous Control Tasks[C]. 见:. London, United Kingdom. May 29–June 2, 2023. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。