消息
×
loading..
中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [26]
数学与系统科学研究院 [1]
沈阳自动化研究所 [1]
西安光学精密机械研究... [1]
采集方式
OAI收割 [29]
_filter
_filter
_filter
筛选
浏览/检索结果:
共29条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
作者升序
作者降序
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
Enhancing Iterative Learning Control With Fractional Power Update Law
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 5, 页码: 1137-1149
作者:
Zihan Li
;
Dong Shen
;
Xinghuo Yu
|
收藏
|
浏览/下载:16/0
|
提交时间:2023/04/26
Asymptotic convergence
convergence rate
finite-iteration tracking
fractional power learning rule
limit cycles
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
|
收藏
|
浏览/下载:35/0
|
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Discounted Iterative Adaptive Critic Designs With Novel Stability Analysis for Tracking Control
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2022, 卷号: 9, 期号: 7, 页码: 1262-1272
作者:
Mingming Ha
;
Ding Wang
;
Derong Liu
|
收藏
|
浏览/下载:76/0
|
提交时间:2022/06/27
Adaptive critic design
adaptive dynamic programming (ADP)
approximate dynamic programming
discrete-time nonlinear systems
reinforcement learning
stability analysis
tracking control
value iteration (VI)
A New Integral Critic Learning for Optimal Tracking Control with Applications to Boiler-Turbine Systems
期刊论文
OAI收割
OPTIMAL CONTROL APPLICATIONS & METHODS, 2021, 页码: 16
作者:
Wei, Qinglai
;
Liu, Yujia
;
Lu, Jingwei
;
Ling, Jun
;
Luan, Zhenhua
|
收藏
|
浏览/下载:42/0
|
提交时间:2021/12/28
adaptive dynamic programming
boiler-turbine system
integral reinforcement learning
neural network
policy iteration
Multiagent Reinforcement Learning:Rollout and Policy Iteration
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:
Dimitri Bertsekas
|
收藏
|
浏览/下载:31/0
|
提交时间:2021/04/09
Dynamic programming
multiagent problems
neuro-dynamic programming
policy iteration
reinforcement learning, rollout
Controller Optimization for Multirate Systems Based on Reinforcement Learning
期刊论文
OAI收割
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 3, 页码: 417-427
作者:
Zhan Li
;
Sheng-Ri Xue
;
Xing-Hu Yu
;
Hui-Jun Gao
|
收藏
|
浏览/下载:13/0
|
提交时间:2021/02/22
Multirate system
reinforcement learning
policy iteration
optimal control
controller optimization.
Short-term load forecasting of long-short term memory neural network based on genetic algorithm
会议论文
OAI收割
Wuhan, China, October 30 - November 1, 2020
作者:
Li WT(李婉婷)
;
Zang CZ(臧传治)
;
Liu D(刘鼎)
;
Zeng P(曾鹏)
|
收藏
|
浏览/下载:21/0
|
提交时间:2021/03/14
load forecasting
long-short term neural networks
genetic algorithm
learning rate
iteration number
An off-policy iteration algorithm for robust stabilization of constrained-input uncertain nonlinear systems
期刊论文
OAI收割
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 卷号: 28, 期号: 18, 页码: 5747-5765
作者:
Yang, Xiong
;
Wei, Qinglai
|
收藏
|
浏览/下载:40/0
|
提交时间:2019/01/08
constrained input
mismatched uncertainties
off-policy iteration
reinforcement learning
robust stabilization
Comprehensive comparison of online ADP algorithms for continuous-time optimal control
期刊论文
OAI收割
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
|
收藏
|
浏览/下载:22/0
|
提交时间:2017/09/13
Adaptive Dynamic Programming
Policy Iteration
Integral Reinforcement Learning
Experience Replay
Off-policy
Manifold Regularized Reinforcement Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 4, 页码: 932-943
作者:
Li, Hongliang
;
Liu, Derong
;
Wang, Ding
|
收藏
|
浏览/下载:40/0
|
提交时间:2018/10/10
Adaptive Dynamic Programming
Approximate Dynamic Programming
Approximate Policy Iteration (Api)
Manifold Regularization
Reinforcement Learning (Rl)
首页
上一页
1
2
3
下一页
末页