中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
自动化研究所 [8]
沈阳自动化研究所 [1]
采集方式
OAI收割 [9]
内容类型
期刊论文 [5]
学位论文 [3]
会议论文 [1]
发表日期
2023 [1]
2021 [1]
2018 [1]
2016 [1]
2015 [1]
2014 [1]
更多
学科主题
筛选
浏览/检索结果:
共9条,第1-9条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Data-efficient model-based reinforcement learning with trajectory discrimination
期刊论文
OAI收割
COMPLEX & INTELLIGENT SYSTEMS, 2023, 页码: 10
作者:
Qu, Tuo
;
Duan, Fuqing
;
Zhang, Junge
;
Zhao, Bo
;
Huang, Wenzhen
  |  
收藏
  |  
浏览/下载:11/0
  |  
提交时间:2023/11/16
Reinforcement learning
Deep learning
Continuous control task
World model
Adaptive Critic Designs for Optimal Event-Driven Control of a CSTR System
期刊论文
OAI收割
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2021, 卷号: 17, 期号: 1, 页码: 484-493
作者:
Yang, Xiong
;
Wei, Qinglai
  |  
收藏
  |  
浏览/下载:47/0
  |  
提交时间:2021/01/06
Chemical reactors
Optimal control
Nonlinear systems
Adaptive systems
Cost function
Informatics
Closed loop systems
Adaptive critic designs (ACDs)
continuous stirred tank reactor (CSTR)
discounted cost
event-driven control
reinforcement learning (RL)
Learning Continuous Control through Proximal Policy Optimization for Mobile Robot Navigation
会议论文
OAI收割
Hangzhou, China, December 7-8, 2018
作者:
Zeng TP(曾太平)
  |  
收藏
  |  
浏览/下载:39/0
  |  
提交时间:2018/12/27
Mobile Robots
Deep Reinforcement Learning
Continuous Control
Proximal Policy Optimization
Robot Navigation
Mobile Robot Learning
Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics
期刊论文
OAI收割
IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1339-1347
作者:
Zhu, Yuanheng
;
Zhao, Dongbin
;
Li, Xiangjun
  |  
收藏
  |  
浏览/下载:25/0
  |  
提交时间:2016/12/26
Nonlinear Control Systems
Continuous Time Systems
Learning (Artificial Intelligence)
Optimal Control
Dynamic Programming
Lyapunov Methods
Linear Systems
Reinforcement Learning
Continuous-time Problem
Nonlinear Optimal Tracking Problem
Adaptive Dynamic Programming
Model-free Adaptive Optimal Tracking Algorithm
Lyapunov Analysis
Linear System
连续状态系统的近似最优在线强化学习
学位论文
OAI收割
工学博士, 中国科学院自动化研究所: 中国科学院大学, 2015
作者:
朱圆恒
收藏
  |  
浏览/下载:184/0
  |  
提交时间:2015/09/02
强化学习
最优控制
近似策略迭代
概率近似最优
连续状态系统
收敛性
在线学习
kd树
Reinforcement learning
optimal control
approximate policy iteration
probably approximately correct
continuous-state system
convergence
online learning
kd-tree
Dynamic dual adjustment of daily budgets and bids in sponsored search auctions
期刊论文
OAI收割
DECISION SUPPORT SYSTEMS, 2014, 卷号: 57, 页码: 105-114
作者:
Zhang, Jie
;
Yang, Yanwu
;
Li, Xin
;
Qin, Rui
;
Zeng, Daniel
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2015/08/12
Sponsored search auction
Budget adjustment
Continuous reinforcement learning
Dynamic adjustment
Neural-network-based online optimal control for uncertain non-linear continuous-time systems with control constraints
期刊论文
OAI收割
IET CONTROL THEORY AND APPLICATIONS, 2013, 卷号: 7, 期号: 17, 页码: 2037-2047
作者:
Yang, Xiong
;
Liu, Derong
;
Huang, Yuzhu
收藏
  |  
浏览/下载:32/0
  |  
提交时间:2015/08/12
adaptive control
approximation theory
closed loop systems
continuous time systems
Lyapunov methods
neurocontrollers
nonlinear control systems
optimal control
robust control
uncertain systems
neural network-based online adaptive optimal control
uncertain nonlinear continuous-time systems
control constraints
infinite-horizon optimal control problem
control policy
saturation constraints
identifier-critic architecture
Hamilton-Jacobi-Bellman equation approximation
uncertain system dynamics
critic NN
action-critic dual networks
reinforcement learning
identifier NN
policy iteration
LyapunovaEuros direct method
closed loop system stability
连续状态空间的强化学习问题
学位论文
OAI收割
工学硕士, 中国科学院自动化研究所: 中国科学院研究生院, 2007
何源
收藏
  |  
浏览/下载:238/0
  |  
提交时间:2015/09/02
强化学习
连续状态空间
核方法
函数逼近
reinforcement learning
continuous state space
kernel method
function
连续状态-动作空间下强化学习方法的研究
学位论文
OAI收割
工学博士, 中国科学院自动化研究所: 中国科学院研究生院, 2005
作者:
程玉虎
收藏
  |  
浏览/下载:255/0
  |  
提交时间:2015/09/02
强化学习
连续空间
函数逼近
RBF 网络
模糊推理系统
Reinforcement Learning
Continuous Space
Function Approximation
RBF Network
Fuzzy Inference System