中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [49]
采集方式
OAI收割 [49]
内容类型
期刊论文 [32]
会议论文 [16]
专著章节/文集论文 [1]
发表日期
2023 [2]
2022 [3]
2021 [5]
2020 [3]
2019 [4]
2018 [7]
更多
学科主题
筛选
浏览/检索结果:
共49条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
题名升序
题名降序
作者升序
作者降序
发表日期升序
发表日期降序
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:
Chai, Jiajun
;
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/11/16
Large-scale multiagent
neighboring communication
reinforcement learning (RL)
variational information flow
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat
期刊论文
OAI收割
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:
Jiajun Chai
;
Wenzhang Chen
;
Yuanheng Zhu
;
Zong-xin Yao,
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2022/06/10
Games
Nash equilibrium
Mathematical model
Markov processes
Convergence
Dynamic programming
Training
Deep reinforcement learning (DRL)
generalized policy iteration (GPI)
Markov game (MG)
Nash equilibrium
Q network
zero sum
Empirical Policy Optimization for n-Player Markov Games
期刊论文
OAI收割
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:
Yuanheng Zhu
;
Weifan Li
;
Mengchen Zhao
;
Jianye Hao
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/04/26
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning
期刊论文
OAI收割
IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940}
作者:
Minsong Liu
;
Luntong Li
;
Shuai Hao
;
Yuanheng Zhu
;
Dongbin Zhao
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/04/26
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target
期刊论文
OAI收割
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:
Li, Weifan
;
Zhu, Yuanheng
;
Zhao, Dongbin
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2021/12/28
Reinforcement learning
Missile guidance
Auxiliary learning
Self-imitation learning
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:
Hu, Guangzheng
;
Zhu, Yuanheng
;
Zhao, Dongbin
;
Zhao, Mengchen
;
Hao, Jianye
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2022/01/27
Bandwidth
Protocols
Reinforcement learning
Task analysis
Optimization
Communication networks
Multi-agent systems
Event trigger
limited bandwidth
multi-agent communication
multi-agent reinforcement learning (MARL)
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:
Chai, Jiajun
;
Li, Weifan
;
Zhu, Yuanheng
;
Zhao, Dongbin
;
Ma, Zhe
  |  
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2022/01/27
Multi-agent systems
Training
Task analysis
Reinforcement learning
Sun
Learning systems
Semantics
Centralized training with decentralized execution (CTDE)
multiagent
reinforcement learning
StarCraft II
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors
期刊论文
OAI收割
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
;
He, Haibo
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2021/08/15
Microscopy
Feedback control
Mathematical model
Data models
Dynamic programming
Psychology
Computational modeling
Adaptive dynamic programming (ADP)
heterogeneous corridors
macroscopic pedestrian dynamics
optimal feedback control
pedestrian flow
Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 15
作者:
Yang, Xiong
;
Zhu, Yuanheng
;
Dong, Na
;
Wei, Qinglai
  |  
收藏
  |  
浏览/下载:10/0
  |  
提交时间:2022/01/27
Adaptive critic designs (ACDs)
adaptive dynamic programming (ADP)
decentralized event-driven control
input constraint
reinforcement learning (RL)