中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共54条,第1-10条 帮助

条数/页: 排序方式:
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games 期刊论文  OAI收割
IEEE TRANSACTIONS ON GAMES, 2024, 卷号: 16, 期号: 2, 页码: 470-482
作者:  
Lou, Xingzhou;  Zhang, Junge;  Du, Yali;  Yu, Chao;  He, Zhaofeng
  |  收藏  |  浏览/下载:10/0  |  提交时间:2024/09/09
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文  OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  
Li, Luntong;  Zhu, Yuanheng
  |  收藏  |  浏览/下载:31/0  |  提交时间:2024/07/03
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文  OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  
Zhu, Liao;  Wei, Qinglai;  Guo, Ping
  |  收藏  |  浏览/下载:25/0  |  提交时间:2024/07/03
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文  OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  
Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
  |  收藏  |  浏览/下载:30/0  |  提交时间:2024/02/22
Learning Sliding Policy of Flat Multi-target Objects in Clutter Scenes 期刊论文  OAI收割
INFORMATION TECHNOLOGY AND CONTROL, 2024, 卷号: 53, 期号: 1, 页码: 5-18
作者:  
Wu, Liangdong;  Wu, Jiaxi;  Li, Zhengwei;  Chen, Yurou;  Liu, Zhiyong
  |  收藏  |  浏览/下载:14/0  |  提交时间:2024/09/09
ToolBot: Learning Oriented Keypoints for Tool Usage From Self-Supervision 期刊论文  OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2024, 卷号: 20, 期号: 1, 页码: 723-731
作者:  
Wei, Junhang
  |  收藏  |  浏览/下载:18/0  |  提交时间:2024/07/03
Policy generation network for zero-shot policy learning 期刊论文  OAI收割
COMPUTATIONAL INTELLIGENCE, 2023, 页码: 27
作者:  
Qian, Yiming;  Zhang, Fengyi;  Liu, Zhiyong
  |  收藏  |  浏览/下载:7/0  |  提交时间:2023/11/17
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文  OAI收割
昆士兰, 2023-6
作者:  
Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
  |  收藏  |  浏览/下载:16/0  |  提交时间:2023/06/29
Dependency-Aware Vehicular Task Scheduling Policy for Tracking Service VEC Networks 期刊论文  OAI收割
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 3, 页码: 2400-2414
作者:  
Li, Chao;  Liu, Fagui;  Wang, Bin;  Chen, C. L. Philip;  Tang, Xuhao
  |  收藏  |  浏览/下载:16/0  |  提交时间:2023/11/17
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文  OAI收割
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  
Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/29