中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [4]
采集方式
OAI收割 [4]
内容类型
期刊论文 [4]
发表日期
2022 [1]
2021 [1]
2020 [1]
2017 [1]
学科主题
筛选
浏览/检索结果:
共4条,第1-4条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
发表日期升序
发表日期降序
提交时间升序
提交时间降序
作者升序
作者降序
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Multiagent Adversarial Collaborative Learning via Mean-Field Theory
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 卷号: 51, 期号: 10, 页码: 4994-5007
作者:
Luo, Guiyang
;
Zhang, Hui
;
He, Haibo
;
Li, Jinglin
;
Wang, Fei-Yue
  |  
收藏
  |  
浏览/下载:37/0
  |  
提交时间:2021/12/28
Games
Training
Collaborative work
Task analysis
Nash equilibrium
Sociology
Statistics
Adversarial collaborative learning (ACL)
friend-or-foe Q-learning
mean-field theory
multiagent reinforcement learning (MARL)
Nash Q-learning based equilibrium transfer for integrated energy management game with We-Energy
期刊论文
OAI收割
NEUROCOMPUTING, 2020, 卷号: 396, 页码: 216-223
作者:
Yang, Lingxiao
;
Sun, Qiuye
;
Ma, Dazhong
;
Wei, Qinglai
  |  
收藏
  |  
浏览/下载:47/0
  |  
提交时间:2020/06/22
Nash Q-learning
Integrated energy management game
Interconnected multicarrier systems
Equilibrium transfer
We-Energy
FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 6, 页码: 1367-1379
作者:
Zhang, Zhen
;
Zhao, Dongbin
;
Gao, Junwei
;
Wang, Dongqing
;
Dai, Yujie
  |  
收藏
  |  
浏览/下载:27/0
  |  
提交时间:2017/07/18
Multiagent Reinforcement Learning (Marl)
Nash Equilibrium
Q-learning
Repeated Game