中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge

文献类型:期刊论文

作者Lan Jiang; Hongyun Huang; Zuohua Ding
刊名IEEE/CAA Journal of Automatica Sinica
出版日期2020
卷号7期号:4页码:1179-1189
关键词Deep Q-learning (DQL) experience replay (ER) heuristic knowledge (HK) path planning
ISSN号2329-9266
DOI10.1109/JAS.2019.1911732
英文摘要Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the “curse of dimensionality” issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network; such a process is called experience replay. Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward.
源URL[http://ir.ia.ac.cn/handle/173211/43023]  
专题自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica
推荐引用方式
GB/T 7714
Lan Jiang,Hongyun Huang,Zuohua Ding. Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge[J]. IEEE/CAA Journal of Automatica Sinica,2020,7(4):1179-1189.
APA Lan Jiang,Hongyun Huang,&Zuohua Ding.(2020).Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge.IEEE/CAA Journal of Automatica Sinica,7(4),1179-1189.
MLA Lan Jiang,et al."Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge".IEEE/CAA Journal of Automatica Sinica 7.4(2020):1179-1189.

入库方式: OAI收割

来源:自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。