中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共17条,第1-10条 帮助

条数/页: 排序方式:
Discrete-Time Non-Zero-Sum Games With Completely Unknown Dynamics 期刊论文  OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 6, 页码: 2929-2943
作者:  
Song, Ruizhuo;  Wei, Qinglai;  Zhang, Huaguang;  Lewis, Frank L.
  |  收藏  |  浏览/下载:16/0  |  提交时间:2021/08/15
Multiagent Reinforcement Learning:Rollout and Policy Iteration 期刊论文  OAI收割
IEEE/CAA Journal of Automatica Sinica, 2021, 卷号: 8, 期号: 2, 页码: 249-272
作者:  
Dimitri Bertsekas
  |  收藏  |  浏览/下载:21/0  |  提交时间:2021/04/09
Controller Optimization for Multirate Systems Based on Reinforcement Learning 期刊论文  OAI收割
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 3, 页码: 417-427
作者:  
Zhan Li;  Sheng-Ri Xue;  Xing-Hu Yu;  Hui-Jun Gao
  |  收藏  |  浏览/下载:9/0  |  提交时间:2021/02/22
Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics 期刊论文  OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 8, 页码: 2874-2885
作者:  
Zhang, Qichao;  Zhao, Dongbin
  |  收藏  |  浏览/下载:92/0  |  提交时间:2019/07/12
An off-policy iteration algorithm for robust stabilization of constrained-input uncertain nonlinear systems 期刊论文  OAI收割
INTERNATIONAL JOURNAL OF ROBUST AND NONLINEAR CONTROL, 2018, 卷号: 28, 期号: 18, 页码: 5747-5765
作者:  
Yang, Xiong;  Wei, Qinglai
  |  收藏  |  浏览/下载:31/0  |  提交时间:2019/01/08
Adaptive Q-Learning for Data-Based Optimal Output Regulation With Experience Replay 期刊论文  OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 12, 页码: 3337-3348
作者:  
Luo, Biao;  Yang, Yin;  Liu, Derong
  |  收藏  |  浏览/下载:49/0  |  提交时间:2019/01/08
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文  OAI收割
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  
Zhu, Yuanheng;  Zhao, Dongbin
  |  收藏  |  浏览/下载:15/0  |  提交时间:2017/09/13
Neural-network-based synchronous iteration learning method for multi-player zero-sum games 期刊论文  OAI收割
NEUROCOMPUTING, 2017, 卷号: 242, 页码: 73-82
作者:  
Song, Ruizhuo;  Wei, Qinglai;  Song, Biao
  |  收藏  |  浏览/下载:12/0  |  提交时间:2017/09/12
Off-policy neuro-optimal control for unknown complex-valued nonlinear systems based on policy iteration 期刊论文  OAI收割
NEURAL COMPUTING & APPLICATIONS, 2017, 卷号: 28, 期号: 6, 页码: 1435-1441
作者:  
Song, Ruizhuo;  Wei, Qinglai;  Xiao, Wendong
  |  收藏  |  浏览/下载:17/0  |  提交时间:2017/02/23
Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs 期刊论文  OAI收割
NEUROCOMPUTING, 2017, 卷号: 238, 期号: *, 页码: 377-386
作者:  
Zhang, Qichao;  Zhao, Dongbin;  Zhu, Yuanheng
  |  收藏  |  浏览/下载:9/0  |  提交时间:2017/05/04