中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [11]
采集方式
OAI收割 [11]
内容类型
期刊论文 [11]
发表日期
2024 [2]
2021 [1]
2019 [1]
2018 [3]
2017 [2]
2015 [1]
更多
学科主题
筛选
浏览/检索结果:
共11条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:
Zhu, Liao
;
Wei, Qinglai
;
Guo, Ping
  |  
收藏
  |  
浏览/下载:27/0
  |  
提交时间:2024/07/03
Approximate dynamic programming (ADP)
neural network
off-policy
optimal control
reinforcement learning (RL)
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms
期刊论文
OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:
Chen, Yurou
;
Zhang, Fengyi
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2024/02/22
Reinforcement Learning
Policy gradient
Actor-critic
Value function
Bias-variance trade-off
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:
Song, Ruizhuo
;
Wei, Qinglai
;
Zhang, Huaguang
;
Lewis, Frank L.
  |  
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2021/08/15
Adaptive critic designs
adaptive dynamic programming
approximate dynamic programming
discrete-time
nonzero-sum (NZS)
off-policy
reinforcement learning (RL)
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:
Zhang, Qichao
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:107/0
  |  
提交时间:2019/07/12
Integral reinforcement learning (IRL)
neural network (NN)
nonzero-sum (NZS) games
off-policy
single-critic
unknown drift dynamics
An off-policy iteration algorithm for robust stabilization of constrained-input uncertain nonlinear systems
期刊论文
OAI收割
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 卷号: 28, 期号: 18, 页码: 5747-5765
作者:
Yang, Xiong
;
Wei, Qinglai
  |  
收藏
  |  
浏览/下载:40/0
  |  
提交时间:2019/01/08
constrained input
mismatched uncertainties
off-policy iteration
reinforcement learning
robust stabilization
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:
Luo, Biao
;
Yang, Yin
;
Liu, Derong
  |  
收藏
  |  
浏览/下载:55/0
  |  
提交时间:2019/01/08
Data-based
experience replay
neural networks (NNs)
off-policy
optimal control
Q-learning (QL)
Comprehensive comparison of online ADP algorithms for continuous-time optimal control
期刊论文
OAI收割
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:21/0
  |  
提交时间:2017/09/13
Adaptive Dynamic Programming
Policy Iteration
Integral Reinforcement Learning
Experience Replay
Off-policy
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:
Luo, Biao
;
Liu, Derong
;
Wu, Huai-Ning
;
Wang, Ding
;
Lewis, Frank L.
  |  
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2016/11/09
Adaptive Control
Adaptive Dynamic Programming (Adp)
Data-based
Off-policy Learning
Optimal Control
Policy Gradient
Off-Policy Integral Reinforcement Learning Method to Solve Nonlinear Continuous-Time Multiplayer Nonzero-Sum Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 卷号: 28, 期号: 3, 页码: 704-713
作者:
Song, Ruizhuo
;
Lewis, Frank L.
;
Wei, Qinglai
  |  
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2017/05/05
Adaptive Critic Designs
Adaptive Dynamic Programming (Adp)
Approximate Dynamic Programming
Integral Reinforcement Learning (Irl)
Nonlinear Systems
Nonzero Sum (Nzs)
Off-policy
Reinforcement learning solution for HJB equation arising in constrained optimal control problem
期刊论文
OAI收割
NEURAL NETWORKS, 2015, 卷号: 71, 页码: 150-158
作者:
Luo, Biao
;
Wu, Huai-Ning
;
Huang, Tingwen
;
Liu, Derong
收藏
  |  
浏览/下载:51/0
  |  
提交时间:2016/03/30
Constrained optimal control
Data-based
Off-policy reinforcement learning
Hamilton-Jacobi-Bellman equation
The method of weighted residuals