中国科学院机构知识库网格系统: Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks

Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks

文献类型：期刊论文


作者	Liu XY(刘晓宇)1,2,3,4 ; Xu C(许驰)1,3,4 ; Yu HB(于海斌)1,3,4 ; Zeng P(曾鹏)1,3,4
刊名	Frontiers of Information Technology & Electronic Engineering
出版日期	2022
卷号	23 期号:1 页码:47-60
关键词	Multi-agent deep reinforcement learning End–edge orchestrated Industrial wireless networks Delay Energy consumption
ISSN号	2095-9184
其他题名	基于多智能体深度强化学习的工业无线网络端边协同资源分配
产权排序	1
英文摘要	Edge artificial intelligence will empower the ever simple industrial wireless networks (IWNs) supporting complex and dynamic tasks by collaboratively exploiting the computation and communication resources of both machine-type devices (MTDs) and edge servers. In this paper, we propose a multi-agent deep reinforcement learning based resource allocation (MADRL-RA) algorithm for end–edge orchestrated IWNs to support computation-intensive and delay-sensitive applications. First, we present the system model of IWNs, wherein each MTD is regarded as a self-learning agent. Then, we apply the Markov decision process to formulate a minimum system overhead problem with joint optimization of delay and energy consumption. Next, we employ MADRL to defeat the explosive state space and learn an effective resource allocation policy with respect to computing decision, computation capacity, and transmission power. To break the time correlation of training data while accelerating the learning process of MADRL-RA, we design a weighted experience replay to store and sample experiences categorically. Furthermore, we propose a step-by-step ε-greedy method to balance exploitation and exploration. Finally, we verify the effectiveness of MADRL-RA by comparing it with some benchmark algorithms in many experiments, showing that MADRL-RA converges quickly and learns an effective resource allocation policy achieving the minimum system overhead.
WOS关键词	JOINT OPTIMIZATION ; INTERNET
资助项目	National Key R&D Program of China[2020YFB1710900] ; National Natural Science Foundation of China[62173322] ; National Natural Science Foundation of China[61803368] ; National Natural Science Foundation of China[U1908212] ; China Postdoctoral Science Foundation[2019M661156] ; Youth Innovation Promotion Association, Chinese Academy of Sciences[2019202]
WOS研究方向	Computer Science ; Engineering
语种	英语
WOS记录号	WOS:000751999800005
资助机构	National Key R&D Program of China (No. 2020YFB1710900) ; National Natural Science Foundation of China (Nos. 62173322, 61803368, and U1908212) ; China Postdoctoral Science Foundation (No. 2019M661156) ; Youth Innovation Promotion Association, Chinese Academy of Sciences (No. 2019202)
源URL	[http://ir.sia.cn/handle/173321/30291]
专题	沈阳自动化研究所_工业控制网络与系统研究室
通讯作者	Xu C(许驰); Yu HB(于海斌)
作者单位	1.State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China 2.University of Chinese Academy of Sciences, Beijing 100049, China 3.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China 4.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China
推荐引用方式 GB/T 7714	Liu XY,Xu C,Yu HB,et al. Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks[J]. Frontiers of Information Technology & Electronic Engineering,2022,23(1):47-60.
APA	Liu XY,Xu C,Yu HB,&Zeng P.(2022).Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks.Frontiers of Information Technology & Electronic Engineering,23(1),47-60.
MLA	Liu XY,et al."Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks".Frontiers of Information Technology & Electronic Engineering 23.1(2022):47-60.

入库方式： OAI收割

来源：沈阳自动化研究所

下载0

Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks

其他版本