中国科学院机构知识库网格系统: Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge

中国科学院机构知识库网格

Chinese Academy of Sciences Institutional Repositories Grid

Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge

文献类型：期刊论文


作者	Lan Jiang; Hongyun Huang; Zuohua Ding
刊名	IEEE/CAA Journal of Automatica Sinica
出版日期	2020
卷号	7 期号:4 页码:1179-1189
关键词	Deep Q-learning (DQL) experience replay (ER) heuristic knowledge (HK) path planning
ISSN号	2329-9266
DOI	10.1109/JAS.2019.1911732
英文摘要	Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the “curse of dimensionality” issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network; such a process is called experience replay. Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward.
源URL	[http://ir.ia.ac.cn/handle/173211/43023]
专题	自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式 GB/T 7714	Lan Jiang,Hongyun Huang,Zuohua Ding. Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge[J]. IEEE/CAA Journal of Automatica Sinica,2020,7(4):1179-1189.
APA	Lan Jiang,Hongyun Huang,&Zuohua Ding.(2020).Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge.IEEE/CAA Journal of Automatica Sinica,7(4),1179-1189.
MLA	Lan Jiang,et al."Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge".IEEE/CAA Journal of Automatica Sinica 7.4(2020):1179-1189.

入库方式： OAI收割

来源：自动化研究所

浏览0

下载0

收藏0

其他版本

除非特别说明，本系统中所有内容都受版权保护，并保留所有权利。