中国科学院机构知识库网格系统: Enhancing Reinforcement Learning via Transformer-based State Predictive Representations

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Enhancing Reinforcement Learning via Transformer-based State Predictive Representations

文献类型：期刊论文


作者	Liu MS(刘民颂); Zhu YH(朱圆恒); Chen YR(陈亚冉); Zhao DB(赵冬斌)
刊名	IEEE Transactions on Artificial Intelligence
出版日期	2024-03
页码	1 - 12
英文摘要	Enhancing state representations can effectively mitigate the issue of low sample efficiency in reinforcement learning (RL) within high-dimensional input environments. Existing methods attempt to improve sample efficiency by learning predictive state representations from sequence data. However, there still remain significant challenges in achieving a comprehensive understanding and learning of information within long sequences. Motivated by this, we introduce a transformer-based state predictive representations auxiliary task that promotes better representation learning through self-supervised goals. Specifically, we design a transformer-based predictive model to establish unidirectional and bidirectional prediction tasks for predicting state representations within the latent space. TSPR effectively exploits contextual information within sequences to learn more informative state representations, thereby contributing to the enhancement of policy training in RL. Extensive experiments demonstrate that the combination of TSPR with off-policy RL algorithms leads to a substantial improvement in the sample efficiency of RL. Furthermore, TSPR outperforms state-of-the-art sample-efficient RL methods on both the multiple continuous control (DMControl) and discrete control (Atari) tasks.
源URL	[http://ir.ia.ac.cn/handle/173211/57520]
专题	复杂系统管理与控制国家重点实验室_深度强化学习
作者单位	中国科学院自动化研究所
推荐引用方式 GB/T 7714	Liu MS,Zhu YH,Chen YR,et al. Enhancing Reinforcement Learning via Transformer-based State Predictive Representations[J]. IEEE Transactions on Artificial Intelligence,2024:1 - 12.
APA	Liu MS,Zhu YH,Chen YR,&Zhao DB.(2024).Enhancing Reinforcement Learning via Transformer-based State Predictive Representations.IEEE Transactions on Artificial Intelligence,1 - 12.
MLA	Liu MS,et al."Enhancing Reinforcement Learning via Transformer-based State Predictive Representations".IEEE Transactions on Artificial Intelligence (2024):1 - 12.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。