Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing
文献类型:期刊论文
作者 | Yang SL(杨圣落)1,2,3![]() ![]() |
刊名 | International Journal of Production Research
![]() |
出版日期 | 2021 |
页码 | 1-18 |
关键词 | Deep reinforcement learning dynamic scheduling and reconfiguration A2C reconfigurable manufacturing system (RMS) intelligent scheduling dynamic job arrival |
ISSN号 | 0020-7543 |
产权排序 | 1 |
英文摘要 | To realise the intelligent decision-making of dynamic scheduling and reconfiguration, we studied the intelligent scheduling and reconfiguration with dynamic job arrival for a reconfigurable flow line (RFL) using deep reinforcement learning (DRL), for the first time. The system architecture of intelligent scheduling and reconfiguration in smart manufacturing is proposed, and the mathematical model is established to minimise total tardiness cost. In addition, a DRL system of scheduling and reconfiguration is proposed by designing state features, actions, and rewards for scheduling and reconfiguration agents. Moreover, the advantage actor-critic (A2C) is adapted to solve the studied problem. The training curve shows the A2C-based agents have effectively learned to generate better solutions for unseen instances. The test results show that the A2C-based approach outperforms two traditional meta-heuristics, iterated greedy (IG) and genetic algorithm (GA), in solution quality and CPU times by a large margin. Specifically, the A2C-based approach outperforms IG and GA by 57.43% and 88.30%, using only 0.46 and 2.20 CPU times of IG and GA. The trained model can generate a scheduling or reconfiguration decision within 1.47 ms, which is almost instantaneous and can satisfy real-time optimisation. Our work shows a promising prospect of using DRL for intelligent scheduling and reconfiguration. |
WOS关键词 | ITERATED GREEDY ALGORITHM ; PERMUTATION FLOW-SHOP ; TOTAL TARDINESS ; OPTIMIZATION ; MINIMIZATION ; HEURISTICS ; EARLINESS |
资助项目 | National Natural Science Foundation of China[61803367] ; Natural Science Foundation of Liaoning Province[2019-MS-346] |
WOS研究方向 | Engineering ; Operations Research & Management Science |
语种 | 英语 |
WOS记录号 | WOS:000677750100001 |
资助机构 | National Natural Science Foundation of China (61803367) ; Natural Science Foundation of Liaoning Province (2019-MS-346) |
源URL | [http://ir.sia.cn/handle/173321/29358] ![]() |
专题 | 沈阳自动化研究所_装备制造技术研究室 |
通讯作者 | Xu ZG(徐志刚) |
作者单位 | 1.(Chinese Academy of Sciences, Institutes for Robotics and Intelligent Manufacturing, Shenyang, China 2.University of Chinese Academy of Sciences, Beijing, China 3.Chinese Academy of Sciences, Shenyang Institute of Automation, Shenyang, China |
推荐引用方式 GB/T 7714 | Yang SL,Xu ZG. Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing[J]. International Journal of Production Research,2021:1-18. |
APA | Yang SL,&Xu ZG.(2021).Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing.International Journal of Production Research,1-18. |
MLA | Yang SL,et al."Intelligent scheduling and reconfiguration via deep reinforcement learning in smart manufacturing".International Journal of Production Research (2021):1-18. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。