中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [5]
数学与系统科学研究院 [1]
采集方式
OAI收割 [6]
内容类型
期刊论文 [6]
发表日期
2022 [2]
2021 [1]
2014 [2]
2011 [1]
学科主题
筛选
浏览/检索结果:
共6条,第1-6条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
作者升序
作者降序
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:38/0
  |  
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control
期刊论文
OAI收割
INFORMATION SCIENCES, 2022, 卷号: 584, 页码: 519-535
作者:
Lu, Jingwei
;
Wei, Qinglai
;
Wang, Ziyang
;
Zhou, Tianmin
;
Wang, Fei-Yue
  |  
收藏
  |  
浏览/下载:35/0
  |  
提交时间:2021/12/28
Event-triggered
Non-zero-sum games
Parallel control
Neural network
Adaptive dynamic programming
A Novel Resilient Control Scheme for a Class of Markovian Jump Systems With Partially Unknown Information
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 10
作者:
Zhang, Kun
;
Su, Rong
;
Zhang, Huaguang
  |  
收藏
  |  
浏览/下载:40/0
  |  
提交时间:2022/04/02
Games
Process control
Markov processes
Game theory
Actuators
System dynamics
Heuristic algorithms
Adaptive dynamic programming
integral reinforcement learning (IRL)
resilient control
zero-sum game
Integral Reinforcement Learning for Linear Continuous-Time Zero-Sum Games With Completely Unknown Dynamics
期刊论文
OAI收割
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2014, 卷号: 11, 期号: 3, 页码: 706-714
作者:
Li, Hongliang
;
Liu, Derong
;
Wang, Ding
收藏
  |  
浏览/下载:52/0
  |  
提交时间:2015/08/12
Adaptive critic designs
adaptive dynamic programming
approximate dynamic programming
reinforcement learning
policy iteration
zero-sum games
Multiperson zero-sum differential games for a class of uncertain nonlinear systems
期刊论文
OAI收割
INTERNATIONAL JOURNAL OF ADAPTIVE CONTROL AND SIGNAL PROCESSING, 2014, 卷号: 28, 期号: 3-5, 页码: 205-231
作者:
Liu, Derong
;
Wei, Qinglai
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2015/08/12
uncertain nonlinear systems
neural networks
multiperson zero-sum differential games
adaptive dynamic programming
An iterative adaptive dynamic programming method for solving a class of nonlinear zero-sum differential games
期刊论文
OAI收割
AUTOMATICA, 2011, 卷号: 47, 期号: 1, 页码: 207-214
作者:
Zhang, Huaguang
;
Wei, Qinglai
;
Liu, Derong
收藏
  |  
浏览/下载:22/0
  |  
提交时间:2015/08/12
Adaptive critic designs
Adaptive dynamic programming
Approximate dynamic programming
Neural network
Zero-sum differential games