中国科学院机构知识库网格系统: Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language.

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language.

文献类型：会议论文


作者	Zhourui Guo1,2 ; Meng Yao 2; Yang Yu1,2 ; Qiyue Yin1,2
出版日期	2023-09
会议日期	2023-12-9
会议地点	北京华腾美居酒店
英文摘要	Reinforcement learning is a powerful technique for learning from trial and error, but it often requires a large number of interactions to achieve good performance. In some domains, such as sparse-reward tasks, an oracle that can provide useful feedback or guidance to the agent during the learning process is really of great importance. However, querying the oracle too frequently may be costly or impractical, and the oracle may not always have a clear answer for every situation. Therefore, we propose a novel method for interacting with the oracle in a selective and efficient way, using a retrieval-based approach. We assume that the interaction can be modeled as a sequence of templated questions and answers, and that there is a large corpus of previous interactions available. We use a neural network to encode the current state of the agent and the oracle, and retrieve the most relevant question from the corpus to ask the oracle. We then use the oracle's answer to update the agent's policy and value function. We evaluate our method on an object manipulation task. We show that our method can significantly improve the efficiency of RL by reducing the number of interactions needed to reach a certain level of performance, compared to baselines that do not use the oracle or use it in a naive way.
源URL	[http://ir.ia.ac.cn/handle/173211/57133]
专题	智能系统与工程
通讯作者	Qiyue Yin
作者单位	1.中国科学院大学人工智能学院 2.中国科学院自动化研究所
推荐引用方式 GB/T 7714	Zhourui Guo,Meng Yao,Yang Yu,et al. Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language.[C]. 见:. 北京华腾美居酒店. 2023-12-9.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。