中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems

文献类型:期刊论文

作者Wu, Jingya1,2; Lu, Wenyan1; Yan, Guihai1; Li, Xiaowei1,2
刊名SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS
出版日期2022-09-01
卷号35页码:8
ISSN号2210-5379
关键词Accelerators Heterogeneous systems Bandwidth contention Hardware hazard PCIe
DOI10.1016/j.suscom.2022.100724
英文摘要Accelerators are widely used in specific domains ranging across deep learning, streaming computation and database query. To enable an accelerator, it has to be attached to a primary controller, typically, a CPU. This kind of CPU-accelerator heterogeneous systems is the mainstream of current computer systems. In CPU-accelerator heterogeneous systems, customized optimization for accelerators boosts performance and energy efficiency. However the bandwidth contention of CPU-accelerator interconnection and hardware hazard between multiple tasks on accelerators significantly bottleneck the designed performance. On one hand, limited interconnected bandwidth resource causes bandwidth contention in task offloading from CPU to accelerators. On the other hand, limited hardware resources on accelerators cause hardware hazard during task execution. To take fully advantage of designed computing power in CPU-accelerator heterogeneous system, it is necessary to mitigate these kinds of contention. However, it is hard for programmers and users to solve the contention because of the complexity of both computing tasks and system behaviors. In state-of-the-art, CPU-GPU heterogeneous system have been sufficiently studied. But CPU-FPGA heterogeneous systems are seldom comprehensive analyzed. To help to address this problem, we propose a holistic profiling system, Portrait, to help to model both computation and bandwidth resource in CPU-accelerator heterogeneous system and quantify bandwidth requirement and execution time of given tasks. The experiment shows that Portrait increases the accuracy of the bandwidth requirement up to 97.71% on average, which is 1.95x compared with the state-of-the-art. It also provides a more accurate of computation latency compared to the state-of-the-art that failed to evaluate accelerator behaviors. And it increases accuracy of task execution latency over 97.47% on average. Additionally, based on precise profiling of CPU-accelerator heterogeneous system, Portrait could help task scheduling to mitigate bandwidth contention and hardware hazard more effectively to improve system throughput.
资助项目National Natural Science Foundation of China[61532017] ; National Natural Science Foundation of China[61572470] ; National Natural Science Foundation of China[61432017] ; National Natural Science Foundation of China[61521092] ; National Natural Science Foundation of China[61376043] ; Youth Innovation Promotion Association, CAS[Y404441000]
WOS研究方向Computer Science
语种英语
出版者ELSEVIER
WOS记录号WOS:000788100700005
源URL[http://119.78.100.204/handle/2XEOYT63/19634]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Yan, Guihai; Li, Xiaowei
作者单位1.Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China
2.Univ Chinese Acad Sci, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Wu, Jingya,Lu, Wenyan,Yan, Guihai,et al. Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems[J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS,2022,35:8.
APA Wu, Jingya,Lu, Wenyan,Yan, Guihai,&Li, Xiaowei.(2022).Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems.SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS,35,8.
MLA Wu, Jingya,et al."Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems".SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS 35(2022):8.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。