中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [54]
采集方式
OAI收割 [54]
内容类型
期刊论文 [35]
会议论文 [18]
专著章节/文集论文 [1]
发表日期
2024 [5]
2023 [2]
2022 [2]
2021 [5]
2020 [3]
2019 [4]
更多
学科主题
筛选
浏览/检索结果:
共54条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning
会议论文
OAI收割
河南开封, 2024年5月17-19日
作者:
Huang, Shangjing
;
Zhao, Zijie
;
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2024/06/26
Boosting On-Policy Actor-Critic With Shallow Updates in Critic
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:
Li, Luntong
;
Zhu, Yuanheng
  |  
收藏
  |  
浏览/下载:31/0
  |  
提交时间:2024/07/03
Artificial neural networks
Vectors
Task analysis
Training
Representation learning
Approximation algorithms
Optimization
Actor-critic
deep reinforcement learning (DRL)
proximal policy optimization (PPO)
shallow reinforcement learning (SRL)
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game
期刊论文
OAI收割
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:
Guangzheng Hu
;
Yuanheng Zhu
;
Haoran Li
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning
期刊论文
OAI收割
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:
Boyu Li
;
Haran Li
;
Yuanheng Zhu
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2024/06/05
Boosting On-Policy Actor–Critic With Shallow Updates in Critic
期刊论文
OAI收割
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:
Luntong Li
;
Yuanheng Zhu
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2024/06/05
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:
Chai, Jiajun
;
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:12/0
  |  
提交时间:2023/11/16
Large-scale multiagent
neighboring communication
reinforcement learning (RL)
variational information flow
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat
期刊论文
OAI收割
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:
Jiajun Chai
;
Wenzhang Chen
;
Yuanheng Zhu
;
Zong-xin Yao,
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:33/0
  |  
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Empirical Policy Optimization for n-Player Markov Games
期刊论文
OAI收割
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:
Yuanheng Zhu
;
Weifan Li
;
Mengchen Zhao
;
Jianye Hao
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/04/26
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target
期刊论文
OAI收割
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:
Li, Weifan
;
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:36/0
  |  
提交时间:2021/12/28
Reinforcement learning
Missile guidance
Auxiliary learning
Self-imitation learning