中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [16]
数学与系统科学研究院 [1]
采集方式
OAI收割 [17]
内容类型
期刊论文 [17]
发表日期
2024 [1]
2023 [1]
2022 [1]
2021 [3]
2018 [3]
2017 [4]
更多
学科主题
筛选
浏览/检索结果:
共17条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
Adaptive Optimal Discrete-Time Output-Feedback Using an Internal Model Principle and Adaptive Dynamic Programming
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 131-140
作者:
Zhongyang Wang
;
Youqing Wang
;
Zdzisław Kowalczuk
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2024/01/02
Adaptive dynamic programming (ADP)
internal model principle (IMP)
output feedback problem
policy iteration (PI)
value iteration (VI)
Continuous-Time Stochastic Policy Iteration of Adaptive Dynamic Programming
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 13
作者:
Wei, Qinglai
;
Zhou, Tianmin
;
Lu, Jingwei
;
Liu, Yu
;
Su, Shuai
  |  
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2023/11/17
Adaptive dynamic programming (ADP)
Hamilton-Jacobi-Bellman equation (HJBE)
nonlinear stochastic system
stochastic policy iteration (PI)
Model-Free Reinforcement Learning by Embedding an Auxiliary System for Optimal Control of Nonlinear Systems
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 4, 页码: 1520-1534
作者:
Xu, Zhenhui
;
Shen, Tielong
;
Cheng, Daizhan
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2022/06/21
Mathematical model
Trajectory
Heuristic algorithms
Optimal control
System dynamics
Artificial neural networks
Convergence
Approximate optimal control design
auxiliary trajectory
completely model-free
integral reinforcement learning (IRL)
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors
期刊论文
OAI收割
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
;
He, Haibo
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2021/08/15
Microscopy
Feedback control
Mathematical model
Data models
Dynamic programming
Psychology
Computational modeling
Adaptive dynamic programming (ADP)
heterogeneous corridors
macroscopic pedestrian dynamics
optimal feedback control
pedestrian flow
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:
Song, Ruizhuo
;
Wei, Qinglai
;
Zhang, Huaguang
;
Lewis, Frank L.
  |  
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2021/08/15
Adaptive critic designs
adaptive dynamic programming
approximate dynamic programming
discrete-time
nonzero-sum (NZS)
off-policy
reinforcement learning (RL)
Multiagent Reinforcement Learning:Rollout and Policy Iteration
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:
Dimitri Bertsekas
  |  
收藏
  |  
浏览/下载:22/0
  |  
提交时间:2021/04/09
Dynamic programming
multiagent problems
neuro-dynamic programming
policy iteration
reinforcement learning, rollout
Policy Iteration Algorithm Based Fault Tolerant Tracking Control: An Implementation on Reconfigurable Manipulators
期刊论文
OAI收割
JOURNAL OF ELECTRICAL ENGINEERING & TECHNOLOGY, 2018, 卷号: 13, 期号: 4, 页码: 1739-1750
作者:
Li, Yuanchun
;
Xia, Hongbing
;
Zhao, Bo
  |  
收藏
  |  
浏览/下载:38/0
  |  
提交时间:2018/10/10
Adaptive dynamic programming
Policy iteration
Fault tolerant tracking control
Reconfigurable manipulators
Neural network
Data-Based Optimal Control for Weakly Coupled Nonlinear Systems Using Policy Iteration
期刊论文
OAI收割
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 4, 页码: 511-521
作者:
Li, Chao
;
Liu, Derong
;
Wang, Ding
  |  
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2017/05/03
Adaptive Dynamic Programming (Adp)
Neural Networks (Nns)
Optimal Control
Policy Iteration (Pi)
Unknown Dynamics
Weakly Coupled Systems
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
;
Yang, Xiong
;
Zhang, Qichao
  |  
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)
h Infinity Optimal Control
Policy Iteration (Pi)
Polynomial Nonlinear Systems
Sum Of Squares (Sos)
Neural-network-based synchronous iteration learning method for multi-player zero-sum games
期刊论文
OAI收割
NEUROCOMPUTING, 2017, 卷号: 242, 页码: 73-82
作者:
Song, Ruizhuo
;
Wei, Qinglai
;
Song, Biao
  |  
收藏
  |  
浏览/下载:12/0
  |  
提交时间:2017/09/12
Adaptive Dynamic Programming
Approximate Dynamic Programming
Adaptive Critic Designs
Multi-player
Iteration Learning
Neural Network