中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [12]
数学与系统科学研究院 [9]
计算技术研究所 [1]
过程工程研究所 [1]
采集方式
OAI收割 [23]
内容类型
期刊论文 [22]
会议论文 [1]
发表日期
2023 [2]
2022 [1]
2021 [1]
2020 [1]
2019 [1]
2017 [3]
更多
学科主题
筛选
浏览/检索结果:
共23条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:
Wei, Qinglai
;
Zhou, Tianmin
;
Lu, Jingwei
;
Liu, Yu
;
Su, Shuai
  |  
收藏
  |  
浏览/下载:12/0
  |  
提交时间:2023/11/17
Adaptive dynamic programming (ADP)
Hamilton-Jacobi-Bellman equation (HJBE)
nonlinear stochastic system
stochastic policy iteration (PI)
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:
Ding Wang
;
Jiangyu Wang
;
Mingming Zhao
;
Peng Xin
;
Junfei Qiao
  |  
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2023/08/10
Adaptive critic
artificial neural networks
Hamilton-Jacobi-Bellman (HJB) equation
multi-step heuristic dynamic programming
multi-step reinforcement learning
optimal control
Optimal Synchronization Control of Heterogeneous Asymmetric Input-Constrained Unknown Nonlinear MASs via Reinforcement Learning
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 3, 页码: 520-532
作者:
Lina Xia
;
Qing Li
;
Ruizhuo Song
;
Hamidreza Modares
  |  
收藏
  |  
浏览/下载:22/0
  |  
提交时间:2022/03/09
Asymmetric input-constrained
heterogeneous nonlinear multiagent systems (MASs)
Hamilton-Jacobi-Bellman (HJB) equation
novel observer
reinforcement learning (RL)
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games
会议论文
OAI收割
Shenzhen, China, 05-09 July 2021
作者:
Gong C(龚晨)
;
He Q(何强)
;
Bai YP(白云鹏)
;
Hou XW(侯新文)
;
Fan GL(范国梁)
  |  
收藏
  |  
浏览/下载:22/0
  |  
提交时间:2022/06/27
Video Game
Reinforcement Learning
Quantile Regression
Bellman residual
Wasserstein Distance
Maximum principles and the method of moving planes for the uniformly elliptic nonlocal Bellman operator and applications
期刊论文
OAI收割
ANNALI DI MATEMATICA PURA ED APPLICATA, 2020, 页码: 50
作者:
Dai, Wei
;
Qin, Guolin
  |  
收藏
  |  
浏览/下载:21/0
  |  
提交时间:2020/09/23
Uniformly elliptic nonlocal Bellman operator
Uniformly elliptic nonlocal Monge-Ampere operator
Maximum principles
Method of moving planes
Monotonicity
symmetry and uniqueness
Asymptotic properties
Output Tracking Control Based on Adaptive Dynamic Programming With Multistep Policy Evaluation
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2019, 卷号: 49, 期号: 10, 页码: 2155-2165
作者:
Luo, Biao
;
Liu, Derong
;
Huang, Tingwen
;
Liu, Jiangjiang
  |  
收藏
  |  
浏览/下载:69/0
  |  
提交时间:2019/12/16
Adaptive dynamic programming (ADP)
Bellman equation
heuristic dynamic programming
neural networks (NNs)
output tracking control
Adaptive tracking control for a class of continuous-time uncertain nonlinear systems using the approximate solution of HJB equation
期刊论文
OAI收割
NEUROCOMPUTING, 2017, 卷号: 260, 页码: 432-442
作者:
Mu, Chaoxu
;
Sun, Changyin
;
Wang, Ding
;
Song, Aiguo
  |  
收藏
  |  
浏览/下载:24/0
  |  
提交时间:2017/09/12
Adaptive Tracking Control
Hamilton-jacobi-bellman (Hjb) Equation
Adaptive Dynamic Programming (Adp)
Neural Networks
Uncertainties
Numerical Solution to Optimal Feedback Control by Dynamic Programming Approach: A Local Approximation Algorithm
期刊论文
OAI收割
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2017, 卷号: 30, 期号: 4, 页码: 782-802
作者:
Guo Bao-Zhu
;
Wu Tao-Tao
  |  
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2018/07/30
Curse of dimensionality
Hamilton-Jacobi-Bellman equation
optimal feedback control
upwind finite difference
viscosity solutions.
Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming
期刊论文
OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 卷号: 64, 期号: 5, 页码: 4101-4109
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
;
He, Haibo
;
Ji, Junhong
  |  
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2017/09/12
Actor-critic-identifier
Concurrent Learning
Constrained Input
Event-triggered (Et) Control
Hamilton-jacobi-bellman (Hjb) Equation
Equilibrium Dividend Strategy with Non-exponential Discounting in a Dual Model
期刊论文
OAI收割
JOURNAL OF OPTIMIZATION THEORY AND APPLICATIONS, 2016, 卷号: 168, 期号: 2, 页码: 699-722
作者:
Li, Yongwu
;
Li, Zhongfei
;
Zeng, Yan
  |  
收藏
  |  
浏览/下载:22/0
  |  
提交时间:2018/07/30
Non-exponential discount function
Equilibrium strategy
Dividend payment
Dual model
Hamilton-Jacobi-Bellman equation