中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [7]
计算技术研究所 [2]
西安光学精密机械研究... [1]
采集方式
OAI收割 [10]
内容类型
期刊论文 [7]
会议论文 [3]
发表日期
2024 [1]
2023 [2]
2022 [1]
2021 [2]
2020 [2]
2019 [1]
更多
学科主题
筛选
浏览/检索结果:
共10条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms
期刊论文
OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:
Chen, Yurou
;
Zhang, Fengyi
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2024/02/22
Reinforcement Learning
Policy gradient
Actor-critic
Value function
Bias-variance trade-off
Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning
会议论文
OAI收割
昆士兰, 2023-6
作者:
Li WF(李伟凡)
;
Zhu YH(朱圆恒)
;
Zhao DB(赵冬斌)
  |  
收藏
  |  
浏览/下载:10/0
  |  
提交时间:2023/06/29
multi-agent
reinforcement learning
policy gradient
Energy-Efficient Design for a NOMA Assisted STAR-RIS Network With Deep Reinforcement Learning
期刊论文
OAI收割
IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2023, 卷号: 72, 期号: 4, 页码: 5424-5428
作者:
Guo, Yi
;
Fang, Fang
;
Cai, Donghong
;
Ding, Zhiguo
  |  
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2023/07/13
Energy efficiency
deep deterministic policy gradient (DDPG)
simultaneous transmitting and reflecting reconfigurable intelligent surfaces (STAR-RISs)
non-orthogonal multiple access (NOMA)
multiple-input and single-output (MISO)
A dynamic ensemble deep deterministic policy gradient recursive network for spatiotemporal traffic speed forecasting in an urban road network
期刊论文
OAI收割
DIGITAL SIGNAL PROCESSING, 2022, 卷号: 129, 页码: 16
作者:
Mi, Xiwei
;
Yu, Chengqing
;
Liu, Xinwei
;
Yan, Guangxi
;
Yu, Fuhao
  |  
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2023/07/12
Spatiotemporal traffic speed forecasting
Deep deterministic policy gradient
Simple recursive network
Temporal convolution network
Deep Deterministic Policy Gradient for High-Speed Train Trajectory Optimization
期刊论文
OAI收割
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 页码: 13
作者:
Ning, Lingbin
;
Zhou, Min
;
Hou, Zhuopu
;
Goverde, Rob M. P.
;
Wang, Fei-Yue
  |  
收藏
  |  
浏览/下载:15/0
  |  
提交时间:2022/01/27
Rail transportation
Training
Heuristic algorithms
Resistance
Optimal control
Trajectory optimization
Switches
High-speed railway
train trajectory optimization
deep deterministic policy gradient
energy efficiency
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning
会议论文
OAI收割
Suzhou, China, May 14-16, 2021
作者:
Ma, Ruichen
;
Wang, Yu
;
Wang, Rui
;
Wang, Shuo
  |  
收藏
  |  
浏览/下载:5/0
  |  
提交时间:2023/08/02
Omnidirectional Drift Control
Undulating Fin
Underwater Biomimetic Vehicle-manipulator System (UBVMS)
Reinforcement Learning
Twin Delayed Deep Deterministic policy gradient (TD3)
RL-AKF: An Adaptive Kalman Filter Navigation Algorithm Based on Reinforcement Learning for Ground Vehicles
期刊论文
OAI收割
REMOTE SENSING, 2020, 卷号: 12, 期号: 11, 页码: 25
作者:
Gao, Xile
;
Luo, Haiyong
;
Ning, Bokun
;
Zhao, Fang
;
Bao, Linfeng
  |  
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2020/12/10
integrated navigation
Kalman filter
process noise covariance estimation
reinforcement learning
deep deterministic policy gradient
Image captioning via hierarchical attention mechanism and policy gradient optimization
期刊论文
OAI收割
SIGNAL PROCESSING, 2020, 卷号: 167, 页码: 12
作者:
Yan, Shiyang
;
Xie, Yuan
;
Wu, Fangyu
;
Smith, Jeremy S.
;
Lu, Wenjin
  |  
收藏
  |  
浏览/下载:30/0
  |  
提交时间:2020/03/30
Image captioning
Hierarchical attention mechanism
Generative adversarial network
Reinforcement learning
Policy gradient
Conservative Policy Gradient in Multi-critic Setting
会议论文
OAI收割
Hangzhou, China, 2019.11.22-24
作者:
Xi, Bao
;
Wang, Rui
;
Wang, Shuo
;
Lu, Tao
;
Cai, Yinghao
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2021/02/02
inconsistancy
stablility
Q learning
policy gradient
Policy Gradient Adaptive Dynamic Programming for Data-Based Optimal Control
期刊论文
OAI收割
IEEE TRANSACTIONS ON CYBERNETICS, 2017, 卷号: 47, 期号: 10, 页码: 3341-3354
作者:
Luo, Biao
;
Liu, Derong
;
Wu, Huai-Ning
;
Wang, Ding
;
Lewis, Frank L.
  |  
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2016/11/09
Adaptive Control
Adaptive Dynamic Programming (Adp)
Data-based
Off-policy Learning
Optimal Control
Policy Gradient