中国科学院机构知识库网格系统: 检索

浏览/检索结果: 共5条，第1-5条

帮助

条数/页：排序方式：
	Neural Dynamic Responses of Monetary and Social Reward Processes in Adolescents 期刊论文 OAI收割 FRONTIERS IN HUMAN NEUROSCIENCE, 2020, 卷号: 14, 页码: 16 作者: Wang, Di; Liu, Tongran; Shi, Jiannong \| 收藏 \| 浏览/下载：41/0 \| 提交时间：2020/06/15 reward processes neurodevelopment adolescence social reward monetary reward event-related potential
	Online reinforcement learning control by Bayesian inference 期刊论文 OAI收割 IET CONTROL THEORY AND APPLICATIONS, 2016, 卷号: 10, 期号: 12, 页码: 1331-1338 作者: Xia, Zhongpu; Zhao, Dongbin; Dongbin Zhao \| 收藏 \| 浏览/下载：76/0 \| 提交时间：2016/06/15 Learning Systems Bayes Methods Gaussian Processes Optimal Control Online Reinforcement Learning Control Bayesian Inference Self-learning Control Probability Action Value Function Gaussian Process Bayesian-state-action-reward-state-action Algorithm
	On average reward semi-markov decision processes with a general multichain structure 期刊论文 OAI收割 MATHEMATICS OF OPERATIONS RESEARCH, 2004, 卷号: 29, 期号: 2, 页码: 339-352 作者: Jianyong, L; Xiaobo, Z \| 收藏 \| 浏览/下载：52/0 \| 提交时间：2018/07/30 semi-Markov decision processes average reward criterion multichain structure data-transformation method optimal policy
	Weighted Markov decision processes with perturbation 期刊论文 OAI收割 MATHEMATICAL METHODS OF OPERATIONS RESEARCH, 2001, 卷号: 53, 期号: 3, 页码: 465-480 作者: Liu, K; Filar, JA \| 收藏 \| 浏览/下载：59/0 \| 提交时间：2018/07/30 Markov decision processes weighted reward optimal policy delta-optimal singular perturbation general perturbation
	Nonhomogeneous Markov decision processes with Borel state space - The average criterion with nonuniformly bounded rewards 期刊论文 OAI收割 MATHEMATICS OF OPERATIONS RESEARCH, 2000, 卷号: 25, 期号: 4, 页码: 667-678 作者: Guo, XP; Liu, JY; Liu, K \| 收藏 \| 浏览/下载：61/0 \| 提交时间：2018/07/30 nonhomogeneous Markov decision processes average reward criterion optimality equations epsilon(>= 0)-optimal policies rolling horizon algorithm