中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
首页
机构
成果
学者
登录
注册
登陆
×
验证码:
换一张
忘记密码?
记住我
×
校外用户登录
CAS IR Grid
机构
数学与系统科学研究院 [3]
自动化研究所 [2]
计算技术研究所 [1]
采集方式
OAI收割 [6]
内容类型
期刊论文 [6]
发表日期
2024 [1]
2023 [1]
2020 [1]
2017 [1]
2004 [2]
学科主题
筛选
浏览/检索结果:
共6条,第1-6条
帮助
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
题名升序
题名降序
提交时间升序
提交时间降序
作者升序
作者降序
发表日期升序
发表日期降序
Consensus-Agent Deep Reinforcement Learning for Face Aging
期刊论文
OAI收割
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1795-1809
作者:
Lin, Ling
;
Liu, Hao
;
Liang, Jinqiao
;
Li, Zhendong
;
Feng, Jiao
  |  
收藏
  |  
浏览/下载:16/0
  |  
提交时间:2024/05/20
Face aging
deep reinforcement learning
Markov decision process
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning
期刊论文
OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:
Wai-Chung Kwan
;
Hong-Ru Wang
;
Hui-Min Wang
;
Kam-Fai Wong
  |  
收藏
  |  
浏览/下载:2/0
  |  
提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process
HDec-POSMDPs MRS Exploration and Fire Searching Based on IoT Cloud Robotics
期刊论文
OAI收割
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 3, 页码: 364-377
作者:
Ayman El Shenawy
;
Khalil Mohamed1, Hany Harb
  |  
收藏
  |  
浏览/下载:21/0
  |  
提交时间:2021/02/22
Multi-robot systems
hybrid decentralized partially observable semi-Markov decision process (HDec-POSMDPs)
multi-robot systems (MRS) exploration and fire searching
cloud robotics
cloud computing.
Maintenance optimization for a Markovian deteriorating system with population heterogeneity
期刊论文
OAI收割
IIE TRANSACTIONS, 2017, 卷号: 49, 期号: 1, 页码: 96-109
作者:
van Oosterom, Chiel
;
Peng, Hao
;
van Houtum, Geert-Jan
  |  
收藏
  |  
浏览/下载:26/0
  |  
提交时间:2018/07/30
Replacement optimization
population heterogeneity
partially observable Markov decision process
optimal policy structure
Empty container management in a port with long-run average criterion
期刊论文
OAI收割
MATHEMATICAL AND COMPUTER MODELLING, 2004, 卷号: 40, 期号: 1-2, 页码: 85-100
作者:
Li, JA
;
Liu, K
;
Leung, SCH
;
Lai, KK
  |  
收藏
  |  
浏览/下载:8/0
  |  
提交时间:2018/07/30
containerization problem
inventory
negative demand
Markov decision process
Potential-based online policy iteration algorithms for Markov decision processes
期刊论文
OAI收割
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 卷号: 49, 期号: 4, 页码: 493-505
作者:
Fang, HT
;
Cao, XR
  |  
收藏
  |  
浏览/下载:13/0
  |  
提交时间:2018/07/30
Markov decision process
potential
recursive optimization