中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks

文献类型:期刊论文

作者Liu XY(刘晓宇)1,2,3,4; Xu C(许驰)1,3,4; Yu HB(于海斌)1,3,4; Zeng P(曾鹏)1,3,4
刊名Frontiers of Information Technology & Electronic Engineering
出版日期2022
卷号23期号:1页码:47-60
关键词Multi-agent deep reinforcement learning End–edge orchestrated Industrial wireless networks Delay Energy consumption
ISSN号2095-9184
其他题名基于多智能体深度强化学习的工业无线网络端边协同资源分配
产权排序1
英文摘要

Edge artificial intelligence will empower the ever simple industrial wireless networks (IWNs) supporting complex and dynamic tasks by collaboratively exploiting the computation and communication resources of both machine-type devices (MTDs) and edge servers. In this paper, we propose a multi-agent deep reinforcement learning based resource allocation (MADRL-RA) algorithm for end–edge orchestrated IWNs to support computation-intensive and delay-sensitive applications. First, we present the system model of IWNs, wherein each MTD is regarded as a self-learning agent. Then, we apply the Markov decision process to formulate a minimum system overhead problem with joint optimization of delay and energy consumption. Next, we employ MADRL to defeat the explosive state space and learn an effective resource allocation policy with respect to computing decision, computation capacity, and transmission power. To break the time correlation of training data while accelerating the learning process of MADRL-RA, we design a weighted experience replay to store and sample experiences categorically. Furthermore, we propose a step-by-step ε-greedy method to balance exploitation and exploration. Finally, we verify the effectiveness of MADRL-RA by comparing it with some benchmark algorithms in many experiments, showing that MADRL-RA converges quickly and learns an effective resource allocation policy achieving the minimum system overhead.

WOS关键词JOINT OPTIMIZATION ; INTERNET
资助项目National Key R&D Program of China[2020YFB1710900] ; National Natural Science Foundation of China[62173322] ; National Natural Science Foundation of China[61803368] ; National Natural Science Foundation of China[U1908212] ; China Postdoctoral Science Foundation[2019M661156] ; Youth Innovation Promotion Association, Chinese Academy of Sciences[2019202]
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:000751999800005
资助机构National Key R&D Program of China (No. 2020YFB1710900) ; National Natural Science Foundation of China (Nos. 62173322, 61803368, and U1908212) ; China Postdoctoral Science Foundation (No. 2019M661156) ; Youth Innovation Promotion Association, Chinese Academy of Sciences (No. 2019202)
源URL[http://ir.sia.cn/handle/173321/30291]  
专题沈阳自动化研究所_工业控制网络与系统研究室
通讯作者Xu C(许驰); Yu HB(于海斌)
作者单位1.State Key Laboratory of Robotics, Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang 110016, China
2.University of Chinese Academy of Sciences, Beijing 100049, China
3.Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang 110169, China
4.Key Laboratory of Networked Control Systems, Chinese Academy of Sciences, Shenyang 110016, China
推荐引用方式
GB/T 7714
Liu XY,Xu C,Yu HB,et al. Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks[J]. Frontiers of Information Technology & Electronic Engineering,2022,23(1):47-60.
APA Liu XY,Xu C,Yu HB,&Zeng P.(2022).Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks.Frontiers of Information Technology & Electronic Engineering,23(1),47-60.
MLA Liu XY,et al."Multi-agent deep reinforcement learning for end–edge orchestrated resource allocation in industrialwireless networks".Frontiers of Information Technology & Electronic Engineering 23.1(2022):47-60.

入库方式: OAI收割

来源:沈阳自动化研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。