Enhancing Reinforcement Learning via Transformer-based State Predictive Representations
文献类型:期刊论文
作者 | Liu MS(刘民颂)![]() ![]() ![]() ![]() |
刊名 | IEEE Transactions on Artificial Intelligence
![]() |
出版日期 | 2024-03 |
页码 | 1 - 12 |
英文摘要 | Enhancing state representations can effectively mitigate the issue of low sample efficiency in reinforcement learning (RL) within high-dimensional input environments. Existing methods attempt to improve sample efficiency by learning predictive state representations from sequence data. However, there still remain significant challenges in achieving a comprehensive understanding and learning of information within long sequences. Motivated by this, we introduce a transformer-based state predictive representations auxiliary task that promotes better representation learning through self-supervised goals. Specifically, we design a transformer-based predictive model to establish unidirectional and bidirectional prediction tasks for predicting state representations within the latent space. TSPR effectively exploits contextual information within sequences to learn more informative state representations, thereby contributing to the enhancement of policy training in RL. Extensive experiments demonstrate that the combination of TSPR with off-policy RL algorithms leads to a substantial improvement in the sample efficiency of RL. Furthermore, TSPR outperforms state-of-the-art sample-efficient RL methods on both the multiple continuous control (DMControl) and discrete control (Atari) tasks. |
源URL | [http://ir.ia.ac.cn/handle/173211/57520] ![]() |
专题 | 复杂系统管理与控制国家重点实验室_深度强化学习 |
作者单位 | 中国科学院自动化研究所 |
推荐引用方式 GB/T 7714 | Liu MS,Zhu YH,Chen YR,et al. Enhancing Reinforcement Learning via Transformer-based State Predictive Representations[J]. IEEE Transactions on Artificial Intelligence,2024:1 - 12. |
APA | Liu MS,Zhu YH,Chen YR,&Zhao DB.(2024).Enhancing Reinforcement Learning via Transformer-based State Predictive Representations.IEEE Transactions on Artificial Intelligence,1 - 12. |
MLA | Liu MS,et al."Enhancing Reinforcement Learning via Transformer-based State Predictive Representations".IEEE Transactions on Artificial Intelligence (2024):1 - 12. |
入库方式: OAI收割
来源:自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。