中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [47]
沈阳自动化研究所 [4]
地理科学与资源研究所 [1]
计算技术研究所 [1]
科技战略咨询研究院 [1]
采集方式
OAI收割 [54]
内容类型
期刊论文 [45]
会议论文 [8]
学位论文 [1]
发表日期
2024 [6]
2023 [5]
2022 [5]
2021 [7]
2020 [4]
2019 [4]
更多
学科主题
筛选
浏览/检索结果:
共54条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games
期刊论文
OAI收割
IEEE TRANSACTIONS ON GAMES, 2024, 卷号: 16, 期号: 2, 页码: 470-482
作者:
Lou, Xingzhou
;
Zhang, Junge
;
Du, Yali
;
Yu, Chao
;
He, Zhaofeng
  |  
收藏
  |  
浏览/下载:10/0
  |  
提交时间:2024/09/09
Games
Predictive models
Reinforcement learning
Convergence
Task analysis
Correlation
Training
Joint-action embedding
multiagent
policy gradient
reinforcement learning
Boosting On-Policy Actor-Critic With Shallow Updates in Critic
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:
Li, Luntong
;
Zhu, Yuanheng
  |  
收藏
  |  
浏览/下载:31/0
  |  
提交时间:2024/07/03
Artificial neural networks
Vectors
Task analysis
Training
Representation learning
Approximation algorithms
Optimization
Actor-critic
deep reinforcement learning (DRL)
proximal policy optimization (PPO)
shallow reinforcement learning (SRL)
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:
Zhu, Liao
;
Wei, Qinglai
;
Guo, Ping
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2024/07/03
Approximate dynamic programming (ADP)
neural network
off-policy
optimal control
reinforcement learning (RL)
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms
期刊论文
OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:
Chen, Yurou
;
Zhang, Fengyi
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:30/0
  |  
提交时间:2024/02/22
Reinforcement Learning
Policy gradient
Actor-critic
Value function
Bias-variance trade-off
Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes
期刊论文
OAI收割
INFORMATION TECHNOLOGY AND CONTROL, 2024, 卷号: 53, 期号: 1, 页码: 5-18
作者:
Wu, Liangdong
;
Wu, Jiaxi
;
Li, Zhengwei
;
Chen, Yurou
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:14/0
  |  
提交时间:2024/09/09
Deep Learning in Manipulation
Reinforcement Learning
Robot Control
Intelligent system
sliding policy
ToolBot: Learning Oriented Keypoints for Tool Usage From Self-Supervision
期刊论文
OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 卷号: 20, 期号: 1, 页码: 723-731
作者:
Wei, Junhang
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2024/07/03
Data-efficient robot learning
self-superv ision
tool usage
visuomotor policy
Policy generation network for zero-shot policy learning
期刊论文
OAI收割
COMPUTATIONAL INTELLIGENCE, 2023, 页码: 27
作者:
Qian, Yiming
;
Zhang, Fengyi
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:7/0
  |  
提交时间:2023/11/17
knowledge representation
lifelong reinforcement learning
zero-shot policy generation
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning
会议论文
OAI收割
昆士兰, 2023-6
作者:
Li WF(李伟凡)
;
Zhu YH(朱圆恒)
;
Zhao DB(赵冬斌)
  |  
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2023/06/29
multi-agent
reinforcement learning
policy gradient
Dependency-Aware Vehicular Task Scheduling Policy for Tracking Service VEC Networks
期刊论文
OAI收割
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 3, 页码: 2400-2414
作者:
Li, Chao
;
Liu, Fagui
;
Wang, Bin
;
Chen, C. L. Philip
;
Tang, Xuhao
  |  
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2023/11/17
Task analysis
Intelligent vehicles
Optimization
Processor scheduling
Vehicle dynamics
Heuristic algorithms
Costs
Deep reinforcement learning (DRL)
scheduling policy
tracking service
vehicular edge computing (VEC)
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning
期刊论文
OAI收割
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:
Qiu JY(邱俊彦)
;
Haidong Zhang
;
Yiping Yang
  |  
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2024/05/29
reinforcement learning
dialogue policy learning
curriculum learning
knowledge distillation