中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [8]
采集方式
OAI收割 [8]
内容类型
期刊论文 [7]
学位论文 [1]
发表日期
2024 [1]
2017 [1]
2016 [2]
2015 [1]
2014 [1]
2013 [2]
更多
学科主题
筛选
浏览/检索结果:
共8条,第1-8条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Online Off-Policy Reinforcement Learning for Optimal Control of Unknown Nonlinear Systems Using Neural Networks
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 11
作者:
Zhu, Liao
;
Wei, Qinglai
;
Guo, Ping
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2024/07/03
Adaptive dynamic programming
nonlinear systems
online learning
optimal control
reinforcement learning (RL)
Online identifier-actor-critic algorithm for optimal control of nonlinear systems
期刊论文
OAI收割
OPTIMAL CONTROL APPLICATIONS & METHODS, 2017, 卷号: 38, 期号: 3, 页码: 317-335
作者:
Lin, Hanquan
;
Wei, Qinglai
;
Liu, Derong
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2017/07/18
Adaptive Dynamic Programming
Optimal Control
Discrete-time
Nonlinear System
Neural Network
Online Learning
Lyapunov Method
Online reinforcement learning control by Bayesian inference
期刊论文
OAI收割
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338
作者:
Xia, Zhongpu
;
Zhao, Dongbin
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:67/0
  |  
提交时间:2016/06/15
Learning Systems
Bayes Methods
Gaussian Processes
Optimal Control
Online Reinforcement Learning Control
Bayesian Inference
Self-learning Control
Probability
Action Value Function
Gaussian Process
Bayesian-state-action-reward-state-action Algorithm
A neural-network-based online optimal control approach for nonlinear robust decentralized stabilization
期刊论文
OAI收割
SOFT COMPUTING, 2016, 卷号: 20, 期号: 2, 页码: 707-716
作者:
Wang, Ding
;
Liu, Derong
;
Li, Hongliang
;
Ma, Hongwen
;
Li, Chao
  |  
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2016/06/14
Adaptive Dynamic Programming
Approximate Dynamic Programming
Neural Networks
Online Optimal Control
Robust Decentralized Stabilization
Uncertain Nonlinear Systems
连续状态系统的近似最优在线强化学习
学位论文
OAI收割
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:
朱圆恒
收藏
  |  
浏览/下载:184/0
  |  
提交时间:2015/09/02
强化学习
最优控制
近似策略迭代
概率近似最优
连续状态系统
收敛性
在线学习
kd树
Reinforcement learning
optimal control
approximate policy iteration
probably approximately correct
continuous-state system
convergence
online learning
kd-tree
Budget Planning for Coupled Campaigns in Sponsored Search Auctions
期刊论文
OAI收割
INTERNATIONAL JOURNAL OF ELECTRONIC COMMERCE, 2014, 卷号: 18, 期号: 3, 页码: 39-65
作者:
Yang, Yanwu
;
Qin, Rui
;
Jansen, Bernard J.
;
Zhang, Jie
;
Zeng, Daniel
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2015/08/12
Advertising campaigns
budget planning decision analysis
online advertising
operations research in marketing
optimal control
sponsored search
sponsored search auctions
Adaptive optimal control for a class of continuous-time affine nonlinear systems with unknown internal dynamics
期刊论文
OAI收割
NEURAL COMPUTING & APPLICATIONS, 2013, 卷号: 23, 期号: 7-8, 页码: 1843-1850
作者:
Liu, Derong
;
Yang, Xiong
;
Li, Hongliang
收藏
  |  
浏览/下载:35/0
  |  
提交时间:2015/08/12
Adaptive dynamic programming
Reinforcement learning
Policy iteration
Adaptive optimal control
Neural network
Online control
Nonlinear system
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
期刊论文
OAI收割
IET CONTROL THEORY AND APPLICATIONS, 2013, 卷号: 7, 期号: 17, 页码: 2037-2047
作者:
Yang, Xiong
;
Liu, Derong
;
Huang, Yuzhu
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2015/08/12
adaptive control
approximation theory
closed loop systems
continuous time systems
Lyapunov methods
neurocontrollers
nonlinear control systems
optimal control
robust control
uncertain systems
neural network-based online adaptive optimal control
uncertain nonlinear continuous-time systems
control constraints
infinite-horizon optimal control problem
control policy
saturation constraints
identifier-critic architecture
Hamilton-Jacobi-Bellman equation approximation
uncertain system dynamics
critic NN
action-critic dual networks
reinforcement learning
identifier NN
policy iteration
LyapunovaEuros direct method
closed loop system stability