中国科学院机构知识库网格系统: Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

文献类型：期刊论文


作者	Yang SL(杨圣落)1,2,3 ; Xu ZG(徐志刚)1,3
刊名	International Journal of Production Research
出版日期	2021
页码	1-18
关键词	Deep reinforcement learning dynamic scheduling and reconfiguration A2C reconfigurable manufacturing system (RMS) intelligent scheduling dynamic job arrival
ISSN号	0020-7543
产权排序	1
英文摘要	To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 and 2.20 CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration.
WOS关键词	ITERATED GREEDY ALGORITHM ; PERMUTATION FLOW-SHOP ; TOTAL TARDINESS ; OPTIMIZATION ; MINIMIZATION ; HEURISTICS ; EARLINESS
资助项目	National Natural Science Foundation of China[61803367] ; Natural Science Foundation of Liaoning Province[2019-MS-346]
WOS研究方向	Engineering ; Operations Research & Management Science
语种	英语
WOS记录号	WOS:000677750100001
资助机构	National Natural Science Foundation of China (61803367) ; Natural Science Foundation of Liaoning Province (2019-MS-346)
源URL	[http://ir.sia.cn/handle/173321/29358]
专题	沈阳自动化研究所_装备制造技术研究室
通讯作者	Xu ZG(徐志刚)
作者单位	1.(Chinese Academy of Sciences, Institutes for Robotics and Intelligent Manufacturing, Shenyang, China 2.University of Chinese Academy of Sciences, Beijing, China 3.Chinese Academy of Sciences, Shenyang Institute of Automation, Shenyang, China
推荐引用方式 GB/T 7714	Yang SL,Xu ZG. Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing[J]. International Journal of Production Research,2021:1-18.
APA	Yang SL,&Xu ZG.(2021).Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing.International Journal of Production Research,1-18.
MLA	Yang SL,et al."Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing".International Journal of Production Research (2021):1-18.

入库方式： OAI收割

来源：沈阳自动化研究所

下载0

Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing

其他版本