Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks
文献类型:期刊论文
作者 | Liu XY(刘晓宇)1,2,3,4![]() ![]() ![]() ![]() |
刊名 | Frontiers of Information Technology & Electronic Engineering
![]() |
出版日期 | 2022 |
卷号 | 23期号:1页码:47-60 |
关键词 | Multi-agent deep reinforcement learning End–edge orchestrated Industrial wireless networks Delay Energy consumption |
ISSN号 | 2095-9184 |
其他题名 | 基于多智能体深度强化学习的工业无线网络端边协同资源分配 |
产权排序 | 1 |
英文摘要 | Edge artificial intelligence will empower the ever simple industrial wireless networks (IWNs) supporting complex and dynamic tasks by collaboratively exploiting the computation and communication resources of both machine-type devices (MTDs) and edge servers. In this paper, we propose a multi-agent deep reinforcement learning based resource allocation (MADRL-RA) algorithm for end–edge orchestrated IWNs to support computation-intensive and delay-sensitive applications. First, we present the system model of IWNs, wherein each MTD is regarded as a self-learning agent. Then, we apply the Markov decision process to formulate a minimum system overhead problem with joint optimization of delay and energy consumption. Next, we employ MADRL to defeat the explosive state space and learn an effective resource allocation policy with respect to computing decision, computation capacity, and transmission power. To break the time correlation of training data while accelerating the learning process of MADRL-RA, we design a weighted experience replay to store and sample experiences categorically. Furthermore, we propose a step-by-step ε-greedy method to balance exploitation and exploration. Finally, we verify the effectiveness of MADRL-RA by comparing it with some benchmark algorithms in many experiments, showing that MADRL-RA converges quickly and learns an effective resource allocation policy achieving the minimum system overhead. |
WOS关键词 | JOINT OPTIMIZATION ; INTERNET |
资助项目 | National Key R&D Program of China[2020YFB1710900] ; National Natural Science Foundation of China[62173322] ; National Natural Science Foundation of China[61803368] ; National Natural Science Foundation of China[U1908212] ; China Postdoctoral Science Foundation[2019M661156] ; Youth Innovation Promotion Association, Chinese Academy of Sciences[2019202] |
WOS研究方向 | Computer Science ; Engineering |
语种 | 英语 |
WOS记录号 | WOS:000751999800005 |
资助机构 | National Key R&D Program of China (No. 2020YFB1710900) ; National Natural Science Foundation of China (Nos. 62173322, 61803368, and U1908212) ; China Postdoctoral Science Foundation (No. 2019M661156) ; Youth Innovation Promotion Association, Chinese Academy of Sciences (No. 2019202) |
源URL | [http://ir.sia.cn/handle/173321/30291] ![]() |
专题 | 沈阳自动化研究所_工业控制网络与系统研究室 |
通讯作者 | Xu C(许驰); Yu HB(于海斌) |
作者单位 | 1.State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 2.University of Chinese Academy of Sciences, Beijing 100049, China 3.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China 4.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China |
推荐引用方式 GB/T 7714 | Liu XY,Xu C,Yu HB,et al. Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks[J]. Frontiers of Information Technology & Electronic Engineering,2022,23(1):47-60. |
APA | Liu XY,Xu C,Yu HB,&Zeng P.(2022).Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks.Frontiers of Information Technology & Electronic Engineering,23(1),47-60. |
MLA | Liu XY,et al."Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks".Frontiers of Information Technology & Electronic Engineering 23.1(2022):47-60. |
入库方式: OAI收割
来源:沈阳自动化研究所
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。