中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共4条,第1-4条 帮助

条数/页: 排序方式:
CASOG: Conservative Actor–Critic With SmOoth Gradient for Skill Learning in Robot-Assisted Intervention 期刊论文  OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 页码: 10
作者:  
Li, Hao;  Zhou, Xiao-Hu;  Xie, Xiao-Liang;  Liu, Shi-Qi;  Feng, Zhen-Qiu
  |  收藏  |  浏览/下载:25/0  |  提交时间:2024/02/22
Offline Pre-trained Multi-agent Decision Transformer 期刊论文  OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  
Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing
  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/04/23
Offline reinforcement learning with representations for actions 期刊论文  OAI收割
INFORMATION SCIENCES, 2022, 卷号: 610, 页码: 746-758
作者:  
Lou, Xingzhou;  Yin, Qiyue;  Zhang, Junge;  Yu, Chao;  He, Zhaofeng
  |  收藏  |  浏览/下载:45/0  |  提交时间:2022/11/14
POPO: Pessimistic Offline Policy Optimization 会议论文  OAI收割
Singapore, Singapore, 23-27 May 2022
作者:  
He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
  |  收藏  |  浏览/下载:24/0  |  提交时间:2022/06/27