中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
数学与系统科学研究院 [8]
自动化研究所 [3]
软件研究所 [2]
地理科学与资源研究所 [1]
科技战略咨询研究院 [1]
沈阳自动化研究所 [1]
更多
采集方式
OAI收割 [16]
内容类型
期刊论文 [13]
EI期刊论文 [1]
会议论文 [1]
学位论文 [1]
发表日期
2021 [1]
2020 [1]
2016 [1]
2015 [2]
2005 [2]
2004 [1]
更多
学科主题
Applied [1]
Mathematic... [1]
筛选
浏览/检索结果:
共16条,第1-10条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Optimal Policies for Quantum Markov Decision Processes
期刊论文
OAI收割
International Journal of Automation and Computing, 2021, 卷号: 18, 期号: 3, 页码: 410-421
作者:
Ming-Sheng Ying
;
Yuan Feng
;
Sheng-Gang Ying
  |  
收藏
  |  
浏览/下载:72/0
  |  
提交时间:2021/05/24
Quantum Markov decision processes
quantum machine learning
reinforcement learning
dynamic programming
decision making
Approximate Dynamic Programming for Stochastic Resource Allocation Problems
期刊论文
OAI收割
IEEE/CAA Journal of Automatica Sinica, 2020, 卷号: 7, 期号: 4, 页码: 975-990
作者:
Ali Forootani
;
Raffaele Iervolino
;
Massimo Tipaldi
;
Joshua Neilson
  |  
收藏
  |  
浏览/下载:21/0
  |  
提交时间:2021/03/11
Approximate dynamic programming (ADP)
dynamic programming (DP)
Markov decision processes (MDPs)
resource allocation problem
Efficient approximation of optimal control for continuous-time Markov games
期刊论文
OAI收割
INFORMATION AND COMPUTATION, 2016, 卷号: 247, 页码: 106-129
Fearnley, J
;
Rabe, MN
;
Schewe, S
;
Zhang, LJ
  |  
收藏
  |  
浏览/下载:20/0
  |  
提交时间:2016/12/09
Continuous time Markov decision processes and games
Optimal control
Discretisation
Resilience-driven maintenance scheduling methodology for multi-agent production line system
会议论文
OAI收割
27th Chinese Control and Decision Conference, CCDC 2015, Qingdao, China, May 23-25, 2015
作者:
Wang X(王潇)
;
Qi C(祁超)
;
Wang HW(王洪伟)
;
Si QM(佀庆民)
;
Zhang GW(张国伟)
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2015/11/18
resilience
deteriorating quality states
semi-Markov decision processes
resource constraints
multi-agent reinforcement learning
Transient Reward Approximation for Continuous-Time Markov Chains
期刊论文
OAI收割
IEEE TRANSACTIONS ON RELIABILITY, 2015, 卷号: 64, 期号: 4, 页码: 1254-1275
Hahn, EM
;
Hermanns, H
;
Wimmer, R
;
Becker, B
  |  
收藏
  |  
浏览/下载:28/0
  |  
提交时间:2016/12/13
Continuous-time Markov chains
continuous-time Markov decision processes
abstraction
symbolic methods
ordered binary decision diagrams
Weighted singularly perturbed hybrid stochastic systems
期刊论文
OAI收割
MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2005, 卷号: 62, 期号: 1, 页码: 41-54
作者:
Liu, K
;
Filar, JA
  |  
收藏
  |  
浏览/下载:14/0
  |  
提交时间:2018/07/30
weighted Markov Decision Processes
Hybrid Stochastic System
perturbations
optimal policy
delta-optimal
A markov chain-based probability vector approach for modeling spatial uncertainties of soil classes
EI期刊论文
OAI收割
2005
Li Weidong
;
Zhang Chuanrong
;
Burt James E.
;
Zhu A. Xing
收藏
  |  
浏览/下载:18/0
  |  
提交时间:2012/06/11
Approximation theory
Computer simulation
Decision making
Markov processes
Probability
Risk assessment
Soils
Vectors
On average reward semi-markov decision processes with a general multichain structure
期刊论文
OAI收割
MATHEMATICS OF OPERATIONS RESEARCH, 2004, 卷号: 29, 期号: 2, 页码: 339-352
作者:
Jianyong, L
;
Xiaobo, Z
  |  
收藏
  |  
浏览/下载:41/0
  |  
提交时间:2018/07/30
semi-Markov decision processes
average reward criterion
multichain structure
data-transformation method
optimal policy
Notes on average Markov decision processes with a minimum-variance criterion
期刊论文
OAI收割
OPERATIONS RESEARCH LETTERS, 2002, 卷号: 30, 期号: 2, 页码: 107-116
作者:
Liu, JY
  |  
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2018/07/30
Markov decision processes
nonstationary MDP
average criterion
variance criterion
strong variance optimal policy
A note on optimality conditions for continuous-time Markov decision processes with average cost criterion
期刊论文
OAI收割
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2001, 卷号: 46, 期号: 12, 页码: 1984-1989
作者:
Guo, XP
;
Liu, K
  |  
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2018/07/30
average cost criterion
continuous-time Markov decision processes (MDPs)
optimal stationary policies
optimality inequality