中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [19]
沈阳自动化研究所 [2]
采集方式
OAI收割 [21]
内容类型
期刊论文 [19]
学位论文 [2]
发表日期
2024 [3]
2023 [2]
2021 [5]
2020 [3]
2019 [2]
2018 [1]
更多
学科主题
计算机科学技术::人... [1]
筛选
浏览/检索结果:
共21条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Boosting On-Policy Actor-Critic With Shallow Updates in Critic
期刊论文
OAI收割
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:
Li, Luntong
;
Zhu, Yuanheng
  |  
收藏
  |  
浏览/下载:31/0
  |  
提交时间:2024/07/03
Artificial neural networks
Vectors
Task analysis
Training
Representation learning
Approximation algorithms
Optimization
Actor-critic
deep reinforcement learning (DRL)
proximal policy optimization (PPO)
shallow reinforcement learning (SRL)
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:
Xiaolin Tang
;
Yuyou Yang
;
Teng Liu
;
Xianke Lin
;
Kai Yang
  |  
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2024/01/02
Automatic parking
control strategy
parking deviation (APS)
soft actor-critic (SAC)
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms
期刊论文
OAI收割
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:
Chen, Yurou
;
Zhang, Fengyi
;
Liu, Zhiyong
  |  
收藏
  |  
浏览/下载:30/0
  |  
提交时间:2024/02/22
Reinforcement Learning
Policy gradient
Actor-critic
Value function
Bias-variance trade-off
Position and Attitude Tracking Control of a Biomimetic Underwater Vehicle via Deep Reinforcement Learning
期刊论文
OAI收割
IEEE/ASME Transactions on Mechatronics, 2023, 页码: 1-10
作者:
Ma, Ruichen
;
Wang, Yu
;
Tang, Chong
;
Wang, Shuo
;
Wang, Rui
  |  
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2023/08/03
Biomimetic underwater vehicle (BUV)
Deep reinforcement learning (DRL)
Soft actor-critic (SAC)
Undulatory fin
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot-RoboDact
期刊论文
OAI收割
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 卷号: 72, 页码: 13
作者:
Zhang, Tiandong
;
Wang, Rui
  |  
收藏
  |  
浏览/下载:13/0
  |  
提交时间:2023/11/17
Active disturbance rejection control (ADRC)
bionic exploration robot
motion control
residual reinforcement learning (RRL)
soft actor-critic (SAC)
Generalized Actor-Critic Learning Optimal Control in Smart Home Energy Management
期刊论文
OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 10, 页码: 6614-6623
作者:
Wei, Qinglai
;
Liao, Zehua
;
Shi, Guang
  |  
收藏
  |  
浏览/下载:26/0
  |  
提交时间:2021/11/02
Optimal control
Process control
Smart homes
Dynamic programming
Numerical models
Iterative methods
Informatics
Actor-critic learning
adaptive critic designs
adaptive dynamic programming (ADP)
approximate dynamic programming
energy management
optimal control
smart grid
Sparse online kernelized actor-critic Learning in reproducing kernel Hilbert space
期刊论文
OAI收割
ARTIFICIAL INTELLIGENCE REVIEW, 2021, 页码: 36
作者:
Yang, Yongliang
;
Zhu, Hufei
;
Zhang, Qichao
;
Zhao, Bo
;
Li, Zhenning
  |  
收藏
  |  
浏览/下载:29/0
  |  
提交时间:2021/11/02
Reproducing kernel Hilbert space
Actor-critic learning
Value function approximation
Online sparsification
Non-parametric learning
Siamese Regression Tracking With Reinforced Template Updating
期刊论文
OAI收割
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 628-640
作者:
Zhao, Fei
;
Zhang, Ting
;
Song, Yibing
;
Tang, Ming
;
Wang, Xiaobo
  |  
收藏
  |  
浏览/下载:31/0
  |  
提交时间:2021/03/02
Target tracking
Training
Reinforcement learning
Visualization
Task analysis
Benchmark testing
Head
Siamese regression tracking
actor-critic network
reinforcement learning
A Novel Heterogeneous Actor-critic Algorithm with Recent Emphasizing Replay Memory
期刊论文
OAI收割
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 4, 页码: 619-631
作者:
Bao Xi
;
Rui Wang
;
Shuo Wang
;
Ying-Hao Cai
  |  
收藏
  |  
浏览/下载:17/0
  |  
提交时间:2021/07/20
Reinforcement learning (RL)
actor-critic
experience replay
training efficiency
manipulation skill learning
Intelligent decision-making of scheduling for dynamic permutation flowshop via deep reinforcement learning
期刊论文
OAI收割
Sensors (Switzerland), 2021, 卷号: 21, 期号: 3, 页码: 1-20
作者:
Yang SL(杨圣落)
;
Xu ZG(徐志刚)
;
Wang JY(王军义)
  |  
收藏
  |  
浏览/下载:23/0
  |  
提交时间:2021/02/14
permutation flowshop scheduling problem
deep reinforcement learning
actor-critic
dynamic scheduling
real-time scheduling
new job arrival
tardiness cost