中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共11条,第1-10条 帮助

条数/页: 排序方式:
Leveraging Joint-Action Embedding in Multiagent Reinforcement Learning for Cooperative Games 期刊论文  OAI收割
IEEE TRANSACTIONS ON GAMES, 2024, 卷号: 16, 期号: 2, 页码: 470-482
作者:  
Lou, Xingzhou;  Zhang, Junge;  Du, Yali;  Yu, Chao;  He, Zhaofeng
  |  收藏  |  浏览/下载:12/0  |  提交时间:2024/09/09
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文  OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  
Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
  |  收藏  |  浏览/下载:33/0  |  提交时间:2024/02/22
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文  OAI收割
昆士兰, 2023-6
作者:  
Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
  |  收藏  |  浏览/下载:16/0  |  提交时间:2023/06/29
Energy-Efficient Design for a NOMA Assisted STAR-RIS Network With Deep Reinforcement Learning 期刊论文  OAI收割
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 卷号: 72, 期号: 4, 页码: 5424-5428
作者:  
Guo, Yi;  Fang, Fang;  Cai, Donghong;  Ding, Zhiguo
  |  收藏  |  浏览/下载:10/0  |  提交时间:2023/07/13
A dynamic ensemble deep deterministic policy gradient recursive network for spatiotemporal traffic speed forecasting in an urban road network 期刊论文  OAI收割
DIGITAL SIGNAL PROCESSING, 2022, 卷号: 129, 页码: 16
作者:  
Mi, Xiwei;  Yu, Chengqing;  Liu, Xinwei;  Yan, Guangxi;  Yu, Fuhao
  |  收藏  |  浏览/下载:16/0  |  提交时间:2023/07/12
Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization 期刊论文  OAI收割
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 页码: 13
作者:  
Ning, Lingbin;  Zhou, Min;  Hou, Zhuopu;  Goverde, Rob M. P.;  Wang, Fei-Yue
  |  收藏  |  浏览/下载:27/0  |  提交时间:2022/01/27
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文  OAI收割
Suzhou, China, May 14-16, 2021
作者:  
Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
  |  收藏  |  浏览/下载:9/0  |  提交时间:2023/08/02
RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles 期刊论文  OAI收割
REMOTE SENSING, 2020, 卷号: 12, 期号: 11, 页码: 25
作者:  
Gao, Xile;  Luo, Haiyong;  Ning, Bokun;  Zhao, Fang;  Bao, Linfeng
  |  收藏  |  浏览/下载:37/0  |  提交时间:2020/12/10
Image captioning via hierarchical attention mechanism and policy gradient optimization 期刊论文  OAI收割
SIGNAL PROCESSING, 2020, 卷号: 167, 页码: 12
作者:  
Yan, Shiyang;  Xie, Yuan;  Wu, Fangyu;  Smith, Jeremy S.;  Lu, Wenjin
  |  收藏  |  浏览/下载:42/0  |  提交时间:2020/03/30
Conservative Policy Gradient in Multi-critic Setting 会议论文  OAI收割
Hangzhou, China, 2019.11.22-24
作者:  
Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
  |  收藏  |  浏览/下载:22/0  |  提交时间:2021/02/02