中国科学院机构知识库网格系统: Online reinforcement learning control by Bayesian inference

Online reinforcement learning control by Bayesian inference

文献类型：期刊论文


作者	Xia, Zhongpu; Zhao, Dongbin; Dongbin Zhao
刊名	IET CONTROL THEORY AND APPLICATIONS
出版日期	2016-08-08
卷号	10 期号:12 页码:1331-1338
关键词	Learning Systems Bayes Methods Gaussian Processes Optimal Control Online Reinforcement Learning Control Bayesian Inference Self-learning Control Probability Action Value Function Gaussian Process Bayesian-state-action-reward-state-action Algorithm
DOI	10.1049/iet-cta.2015.0669
文献子类	Article
英文摘要	Reinforcement learning offers a promising way for self-learning control of an unknown system, but it involves the issues of policy evaluation and exploration, especially in the domain of continuous state. In this study, these issues are addressed from the perspective of probability. It models the action value function as the latent variable of Gaussian process, while the reward as the observed variable. Then an online approach is proposed to update the action value function by Bayesian inference. Taking an advantage of the proposed framework, a prior knowledge can be incorporated into the action value function, and thus an efficient exploration strategy is presented. At last, the Bayesian-state-action-reward-state-action algorithm is tested on some benchmark problems and empirical results show its effectiveness.
WOS关键词	AFFINE NONLINEAR-SYSTEMS ; FEEDBACK-CONTROL ; TIME-SYSTEMS ; ALGORITHM ; ITERATION
WOS研究方向	Automation & Control Systems ; Engineering ; Instruments & Instrumentation
语种	英语
WOS记录号	WOS:000381410000003
资助机构	National Natural Science Foundation of China (NSFC)(61273136 ; 61573353 ; 61533017)
源URL	[http://ir.ia.ac.cn/handle/173211/11432]
专题	复杂系统管理与控制国家重点实验室_深度强化学习
通讯作者	Dongbin Zhao
作者单位	Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
推荐引用方式 GB/T 7714	Xia, Zhongpu,Zhao, Dongbin,Dongbin Zhao. Online reinforcement learning control by Bayesian inference[J]. IET CONTROL THEORY AND APPLICATIONS,2016,10(12):1331-1338.
APA	Xia, Zhongpu,Zhao, Dongbin,&Dongbin Zhao.(2016).Online reinforcement learning control by Bayesian inference.IET CONTROL THEORY AND APPLICATIONS,10(12),1331-1338.
MLA	Xia, Zhongpu,et al."Online reinforcement learning control by Bayesian inference".IET CONTROL THEORY AND APPLICATIONS 10.12(2016):1331-1338.

入库方式： OAI收割

来源：自动化研究所

下载0

Online reinforcement learning control by Bayesian inference

其他版本