Intelligent decision-making of scheduling for dynamic permutation flowshop via deep reinforcement learning
文献类型:期刊论文
作者 | Yang SL(杨圣落)1,2,3; Xu ZG(徐志刚)2,3![]() ![]() |
刊名 | Sensors (Switzerland)
![]() |
出版日期 | 2021 |
卷号 | 21期号:3页码:1-20 |
关键词 | permutation flowshop scheduling problem deep reinforcement learning actor-critic dynamic scheduling real-time scheduling new job arrival tardiness cost |
ISSN号 | 1424-8220 |
产权排序 | 1 |
英文摘要 | Dynamic scheduling problems have been receiving increasing attention in recent years due to their practical implications. To realize real-time and the intelligent decision-making of dynamic scheduling, we studied dynamic permutation flowshop scheduling problem (PFSP) with new job arrival using deep reinforcement learning (DRL). A system architecture for solving dynamic PFSP using DRL is proposed, and the mathematical model to minimize total tardiness cost is established. Additionally, the intelligent scheduling system based on DRL is modeled, with state features, actions, and reward designed. Moreover, the advantage actor-critic (A2C) algorithm is adapted to train the scheduling agent. The learning curve indicates that the scheduling agent learned to generate better solutions efficiently during training. Extensive experiments are carried out to compare the A2C-based scheduling agent with every single action, other DRL algorithms, and meta-heuristics. The results show the well performance of the A2C-based scheduling agent considering solution quality, CPU times, and generalization. Notably, the trained agent generates a scheduling action only in 2.16 ms on average, which is almost instantaneous and can be used for real-time scheduling. Our work can help to build a self-learning, real-time optimizing, and intelligent decision-making scheduling system. |
WOS关键词 | ITERATED GREEDY ALGORITHM ; WEIGHTED TARDINESS ; BATCH DELIVERY ; LOCAL SEARCH ; SHOP ; MAKESPAN ; TIME ; OPTIMIZATION ; MINIMIZATION ; JOBS |
资助项目 | National Natural Science Foundation of China[61803367] ; Natural Science Foundation of Liaoning Province[2019-MS-346] |
WOS研究方向 | Chemistry ; Engineering ; Instruments & Instrumentation |
语种 | 英语 |
WOS记录号 | WOS:000615486700001 |
资助机构 | National Natural Science Foundation of China (61803367) ; Natural Science Foundation of Liaoning Province (2019-MS-346) |
源URL | [http://ir.sia.cn/handle/173321/28306] ![]() |
专题 | 沈阳自动化研究所_装备制造技术研究室 |
通讯作者 | Xu ZG(徐志刚) |
作者单位 | 1.University of Chinese Academy of Sciences, Beijing 100049, China 2.Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 3.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China |
推荐引用方式 GB/T 7714 | Yang SL,Xu ZG,Wang JY. Intelligent decision-making of scheduling for dynamic permutation flowshop via deep reinforcement learning[J]. Sensors (Switzerland),2021,21(3):1-20. |
APA | Yang SL,Xu ZG,&Wang JY.(2021).Intelligent decision-making of scheduling for dynamic permutation flowshop via deep reinforcement learning.Sensors (Switzerland),21(3),1-20. |
MLA | Yang SL,et al."Intelligent decision-making of scheduling for dynamic permutation flowshop via deep reinforcement learning".Sensors (Switzerland) 21.3(2021):1-20. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。