Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems
文献类型:期刊论文
作者 | Wu, Jingya1,2; Lu, Wenyan1; Yan, Guihai1; Li, Xiaowei1,2 |
刊名 | SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS |
出版日期 | 2022-09-01 |
卷号 | 35页码:8 |
ISSN号 | 2210-5379 |
关键词 | Accelerators Heterogeneous systems Bandwidth contention Hardware hazard PCIe |
DOI | 10.1016/j.suscom.2022.100724 |
英文摘要 | Accelerators are widely used in specific domains ranging across deep learning, streaming computation and database query. To enable an accelerator, it has to be attached to a primary controller, typically, a CPU. This kind of CPU-accelerator heterogeneous systems is the mainstream of current computer systems. In CPU-accelerator heterogeneous systems, customized optimization for accelerators boosts performance and energy efficiency. However the bandwidth contention of CPU-accelerator interconnection and hardware hazard between multiple tasks on accelerators significantly bottleneck the designed performance. On one hand, limited interconnected bandwidth resource causes bandwidth contention in task offloading from CPU to accelerators. On the other hand, limited hardware resources on accelerators cause hardware hazard during task execution. To take fully advantage of designed computing power in CPU-accelerator heterogeneous system, it is necessary to mitigate these kinds of contention. However, it is hard for programmers and users to solve the contention because of the complexity of both computing tasks and system behaviors. In state-of-the-art, CPU-GPU heterogeneous system have been sufficiently studied. But CPU-FPGA heterogeneous systems are seldom comprehensive analyzed. To help to address this problem, we propose a holistic profiling system, Portrait, to help to model both computation and bandwidth resource in CPU-accelerator heterogeneous system and quantify bandwidth requirement and execution time of given tasks. The experiment shows that Portrait increases the accuracy of the bandwidth requirement up to 97.71% on average, which is 1.95x compared with the state-of-the-art. It also provides a more accurate of computation latency compared to the state-of-the-art that failed to evaluate accelerator behaviors. And it increases accuracy of task execution latency over 97.47% on average. Additionally, based on precise profiling of CPU-accelerator heterogeneous system, Portrait could help task scheduling to mitigate bandwidth contention and hardware hazard more effectively to improve system throughput. |
资助项目 | National Natural Science Foundation of China[61532017] ; National Natural Science Foundation of China[61572470] ; National Natural Science Foundation of China[61432017] ; National Natural Science Foundation of China[61521092] ; National Natural Science Foundation of China[61376043] ; Youth Innovation Promotion Association, CAS[Y404441000] |
WOS研究方向 | Computer Science |
语种 | 英语 |
出版者 | ELSEVIER |
WOS记录号 | WOS:000788100700005 |
源URL | [http://119.78.100.204/handle/2XEOYT63/19634] |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Yan, Guihai; Li, Xiaowei |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing, Peoples R China 2.Univ Chinese Acad Sci, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Wu, Jingya,Lu, Wenyan,Yan, Guihai,et al. Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems[J]. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS,2022,35:8. |
APA | Wu, Jingya,Lu, Wenyan,Yan, Guihai,&Li, Xiaowei.(2022).Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems.SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS,35,8. |
MLA | Wu, Jingya,et al."Portrait: A holistic computation and bandwidth balanced performance evaluation model for heterogeneous systems".SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS 35(2022):8. |
入库方式: OAI收割
来源:计算技术研究所
浏览0
下载0
收藏0
其他版本
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。