中国科学院机构知识库网格系统: Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

文献类型：期刊论文


作者	Yang, Xiong; Liu, Derong; Wang, Ding; Wei, Qinglai
刊名	NEURAL NETWORKS
出版日期	2014-07-01
卷号	55 页码:30-41
关键词	Adaptive critic design Neural network Nonaffine nonlinear system Online learning Reinforcement learning
英文摘要	In this paper, a reinforcement-learning-based direct adaptive control is developed to deliver a desired tracking performance for a class of discrete-time (DT) nonlinear systems with unknown bounded disturbances. We investigate multi-input-multi-output unknown nonaffine nonlinear DT systems and employ two neural networks (NNs). By using Implicit Function Theorem, an action NN is used to generate the control signal and it is also designed to cancel the nonlinearity of unknown DT systems, for purpose of utilizing feedback linearization methods. On the other hand, a critic NN is applied to estimate the cost function, which satisfies the recursive equations derived from heuristic dynamic programming. The weights of both the action NN and the critic NN are directly updated online instead of offline training. By utilizing Lyapunov's direct method, the closed-loop tracking errors and the NN estimated weights are demonstrated to be uniformly ultimately bounded. Two numerical examples are provided to show the effectiveness of the present approach. (C) 2014 Elsevier Ltd. All rights reserved.
WOS标题词	Science & Technology ; Technology ; Life Sciences & Biomedicine
类目[WOS]	Computer Science, Artificial Intelligence ; Neurosciences
研究领域[WOS]	Computer Science ; Neurosciences & Neurology
关键词[WOS]	NEURAL-NETWORK CONTROL ; OUTPUT-FEEDBACK CONTROL ; ADAPTIVE-CONTROL ; CONTROL SCHEME ; DESIGN ; ERROR ; APPROXIMATION ; ARCHITECTURE ; STATE ; NET
收录类别	SCI
语种	英语
WOS记录号	WOS:000337860600004
源URL	[http://ir.ia.ac.cn/handle/173211/3864]
专题	自动化研究所_复杂系统管理与控制国家重点实验室_智能化团队
作者单位	Chinese Acad Sci, Inst Automat, State Key Lab Management & Control Complex Syst, Beijing 100190, Peoples R China
推荐引用方式 GB/T 7714	Yang, Xiong,Liu, Derong,Wang, Ding,et al. Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning[J]. NEURAL NETWORKS,2014,55:30-41.
APA	Yang, Xiong,Liu, Derong,Wang, Ding,&Wei, Qinglai.(2014).Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning.NEURAL NETWORKS,55,30-41.
MLA	Yang, Xiong,et al."Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning".NEURAL NETWORKS 55(2014):30-41.

入库方式： OAI收割

来源：自动化研究所

下载0

Discrete-time online learning control for a class of unknown nonaffine nonlinear systems using reinforcement learning

其他版本