中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
  • 会议论文 [201]
发表日期
学科主题
筛选

浏览/检索结果: 共201条,第1-10条 帮助

限定条件    
条数/页: 排序方式:
Exploration via Joint Policy Diversity for Sparse-Reward Multi-Agent Tasks 会议论文  OAI收割
Macao, China, 2023-8
作者:  
Pei Xu;  Junge Zhang;  Kaiqi Huang
  |  收藏  |  浏览/下载:21/0  |  提交时间:2023/06/19
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文  OAI收割
昆士兰, 2023-6
作者:  
Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
  |  收藏  |  浏览/下载:10/0  |  提交时间:2023/06/29
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文  OAI收割
美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  
Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜)
  |  收藏  |  浏览/下载:15/0  |  提交时间:2023/07/05
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning 会议论文  OAI收割
Kigali City, Rwanda, Africa, 2023-5-5
作者:  
Junjie, Wang;  Yao, Mu;  Dong, Li;  Qichao,Zhang;  Dongbin, Zhao
  |  收藏  |  浏览/下载:11/0  |  提交时间:2023/06/29
Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文  OAI收割
Austin TX, USA, December 5-9, 2022
作者:  
Gong C(龚晨);  Yang Z(杨洲);  Bai YP(白云鹏);  Shi JK(史杰克);  Sinha Arunesh
  |  收藏  |  浏览/下载:10/0  |  提交时间:2023/06/27
Pseudo Value Network Distillation for High-Performance Exploration 会议论文  OAI收割
澳大利亚, 2023-06
作者:  
Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣)
  |  收藏  |  浏览/下载:8/0  |  提交时间:2023/06/28
Knowledge Transfer from Situation Evaluation to Multi-agent Reinforcement Learning 会议论文  OAI收割
New Delhi, India, 2022年11月22-2022年11月26
作者:  
Chen M(陈敏);  Pu ZQ(蒲志强);  Pan Y(潘一);  Yi JQ(易建强)
  |  收藏  |  浏览/下载:7/0  |  提交时间:2023/06/27
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文  OAI收割
Washington DC, USA, 2023-2-7
作者:  
Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang
  |  收藏  |  浏览/下载:10/0  |  提交时间:2023/06/19
Learning Cooperative Policies with Graph Networks in Distributed Swarm Systems 会议论文  OAI收割
Queensland, Australia, June 18-23, 2023
作者:  
Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强);  Ai XL(艾晓琳)
  |  收藏  |  浏览/下载:14/0  |  提交时间:2023/06/12
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文  OAI收割
Jinghong, China, 05-09 December 2022
作者:  
Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
  |  收藏  |  浏览/下载:5/0  |  提交时间:2023/10/25