中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
机构
采集方式
内容类型
发表日期
学科主题
筛选

浏览/检索结果: 共6条,第1-6条 帮助

条数/页: 排序方式:
Consensus-Agent Deep Reinforcement Learning for Face Aging 期刊论文  OAI收割
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1795-1809
作者:  
Lin, Ling;  Liu, Hao;  Liang, Jinqiao;  Li, Zhendong;  Feng, Jiao
  |  收藏  |  浏览/下载:16/0  |  提交时间:2024/05/20
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文  OAI收割
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  
Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/04/23
HDec-POSMDPs MRS Exploration and Fire Searching Based on IoT Cloud Robotics 期刊论文  OAI收割
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 3, 页码: 364-377
作者:  
Ayman El Shenawy;  Khalil Mohamed1, Hany Harb
  |  收藏  |  浏览/下载:21/0  |  提交时间:2021/02/22
Maintenance optimization for a Markovian deteriorating system with population heterogeneity 期刊论文  OAI收割
IIE TRANSACTIONS, 2017, 卷号: 49, 期号: 1, 页码: 96-109
作者:  
van Oosterom, Chiel;  Peng, Hao;  van Houtum, Geert-Jan
  |  收藏  |  浏览/下载:26/0  |  提交时间:2018/07/30
Empty container management in a port with long-run average criterion 期刊论文  OAI收割
MATHEMATICAL AND COMPUTER MODELLING, 2004, 卷号: 40, 期号: 1-2, 页码: 85-100
作者:  
Li, JA;  Liu, K;  Leung, SCH;  Lai, KK
  |  收藏  |  浏览/下载:8/0  |  提交时间:2018/07/30
Potential-based online policy iteration algorithms for Markov decision processes 期刊论文  OAI收割
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2004, 卷号: 49, 期号: 4, 页码: 493-505
作者:  
Fang, HT;  Cao, XR
  |  收藏  |  浏览/下载:13/0  |  提交时间:2018/07/30