中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共10条,第1-10条 帮助

条数/页: 排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文  OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  
Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
  |  收藏  |  浏览/下载:11/0  |  提交时间:2024/02/22
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文  OAI收割
昆士兰, 2023-6
作者:  
Li WF(李伟凡);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
  |  收藏  |  浏览/下载:10/0  |  提交时间:2023/06/29
Energy-Efficient Design for a NOMA Assisted STAR-RIS Network With Deep Reinforcement Learning 期刊论文  OAI收割
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 卷号: 72, 期号: 4, 页码: 5424-5428
作者:  
Guo, Yi;  Fang, Fang;  Cai, Donghong;  Ding, Zhiguo
  |  收藏  |  浏览/下载:6/0  |  提交时间:2023/07/13
A dynamic ensemble deep deterministic policy gradient recursive network for spatiotemporal traffic speed forecasting in an urban road network 期刊论文  OAI收割
DIGITAL SIGNAL PROCESSING, 2022, 卷号: 129, 页码: 16
作者:  
Mi, Xiwei;  Yu, Chengqing;  Liu, Xinwei;  Yan, Guangxi;  Yu, Fuhao
  |  收藏  |  浏览/下载:4/0  |  提交时间:2023/07/12
Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization 期刊论文  OAI收割
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 页码: 13
作者:  
Ning, Lingbin;  Zhou, Min;  Hou, Zhuopu;  Goverde, Rob M. P.;  Wang, Fei-Yue
  |  收藏  |  浏览/下载:15/0  |  提交时间:2022/01/27
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文  OAI收割
Suzhou, China, May 14-16, 2021
作者:  
Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
  |  收藏  |  浏览/下载:5/0  |  提交时间:2023/08/02
RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles 期刊论文  OAI收割
REMOTE SENSING, 2020, 卷号: 12, 期号: 11, 页码: 25
作者:  
Gao, Xile;  Luo, Haiyong;  Ning, Bokun;  Zhao, Fang;  Bao, Linfeng
  |  收藏  |  浏览/下载:18/0  |  提交时间:2020/12/10
Image captioning via hierarchical attention mechanism and policy gradient optimization 期刊论文  OAI收割
SIGNAL PROCESSING, 2020, 卷号: 167, 页码: 12
作者:  
Yan, Shiyang;  Xie, Yuan;  Wu, Fangyu;  Smith, Jeremy S.;  Lu, Wenjin
  |  收藏  |  浏览/下载:30/0  |  提交时间:2020/03/30
Conservative Policy Gradient in Multi-critic Setting 会议论文  OAI收割
Hangzhou, China, 2019.11.22-24
作者:  
Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
  |  收藏  |  浏览/下载:11/0  |  提交时间:2021/02/02
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control 期刊论文  OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:  
Luo, Biao;  Liu, Derong;  Wu, Huai-Ning;  Wang, Ding;  Lewis, Frank L.
  |  收藏  |  浏览/下载:20/0  |  提交时间:2016/11/09