Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge
文献类型:期刊论文
作者 | Lan Jiang; Hongyun Huang; Zuohua Ding |
刊名 | IEEE/CAA Journal of Automatica Sinica
![]() |
出版日期 | 2020 |
卷号 | 7期号:4页码:1179-1189 |
关键词 | Deep Q-learning (DQL) experience replay (ER) heuristic knowledge (HK) path planning |
ISSN号 | 2329-9266 |
DOI | 10.1109/JAS.2019.1911732 |
英文摘要 | Path planning and obstacle avoidance are two challenging problems in the study of intelligent robots. In this paper, we develop a new method to alleviate these problems based on deep Q-learning with experience replay and heuristic knowledge. In this method, a neural network has been used to resolve the “curse of dimensionality” issue of the Q-table in reinforcement learning. When a robot is walking in an unknown environment, it collects experience data which is used for training a neural network; such a process is called experience replay. Heuristic knowledge helps the robot avoid blind exploration and provides more effective data for training the neural network. The simulation results show that in comparison with the existing methods, our method can converge to an optimal action strategy with less time and can explore a path in an unknown environment with fewer steps and larger average reward. |
源URL | [http://ir.ia.ac.cn/handle/173211/43023] ![]() |
专题 | 自动化研究所_学术期刊_IEEE/CAA Journal of Automatica Sinica |
推荐引用方式 GB/T 7714 | Lan Jiang,Hongyun Huang,Zuohua Ding. Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge[J]. IEEE/CAA Journal of Automatica Sinica,2020,7(4):1179-1189. |
APA | Lan Jiang,Hongyun Huang,&Zuohua Ding.(2020).Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge.IEEE/CAA Journal of Automatica Sinica,7(4),1179-1189. |
MLA | Lan Jiang,et al."Path Planning for Intelligent Robots Based on Deep Q-learning With Experience Replay and Heuristic Knowledge".IEEE/CAA Journal of Automatica Sinica 7.4(2020):1179-1189. |
入库方式: OAI收割
来源:自动化研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。