中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters

文献类型:期刊论文

作者Liao, Yunkun1,2,3; Wu, Jingya1; Lu, Wenyan1,4; Li, Xiaowei3; Yan, Guihai1,4
刊名IEEE TRANSACTIONS ON COMPUTERS
出版日期2024-08-01
卷号73期号:8页码:2081-2095
关键词Central Processing Unit Engines Jitter Computers Pipelines Programming Encryption Disaggregated datacenter SmartNIC RDMA hardware accelerator
ISSN号0018-9340
DOI10.1109/TC.2024.3404089
英文摘要This paper presents DPU-Direct, an accelerator disaggregation system that connects accelerator nodes (ANs) and CPU nodes (CNs) over a standard Remote Direct Memory Access (RDMA) network. DPU-Direct eliminates the latency introduced by the CPU-based network stack, and PCIe interconnects between network I/O and the accelerator. The DPU-Direct system architecture includes a DPU Wrapper hardware architecture, an RDMA-based Accelerator Access Pattern (RAAP), and a CN-side programming model. The DPU Wrapper connects accelerators directly with the RDMA engine, turning ANs into disaggregation-native devices. The RAAP provides the CN with low-latency and high throughput accelerator semantics based on standard RDMA operations. Our FPGA prototype demonstrates DPU-Direct's efficacy with two proof-of-concept applications: AES encryption and key-value cache, which are computationally intensive and latency-sensitive. DPU-Direct yields a 400x speedup in AES encryption over the CPU baseline and matches the performance of the locally integrated AES accelerator. For key-value cache, DPU-Direct reduces the average end-to-end latency by 1.66x for GETs and 1.30x for SETs over the CPU-RDMA-Polling baseline, reducing latency jitter by over 10x for both operations.
资助项目National Natural Science Foundation of China (NSFC)[62002340] ; National Natural Science Foundation of China (NSFC)[62090020] ; National Natural Science Foundation of China (NSFC)[61872336] ; Youth Innovation Promotion Association CAS[Y201923] ; Strategic Priority Research Program of the Chinese Academy of Sciences[XDB44030100] ; Internship program of YUSUR Technology Co., Ltd.
WOS研究方向Computer Science ; Engineering
语种英语
WOS记录号WOS:001270596400013
出版者IEEE COMPUTER SOC
源URL[http://119.78.100.204/handle/2XEOYT63/39838]  
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Wu, Jingya; Yan, Guihai
作者单位1.Chinese Acad Sci, Inst Comp Technol, SKLP, Beijing 100190, Peoples R China
2.Univ Chinese Acad Sci, Beijing 100190, Peoples R China
3.Zhongguancun Lab, Beijing 100190, Peoples R China
4.YUSUR Tech Co Ltd, Beijing 100190, Peoples R China
推荐引用方式
GB/T 7714
Liao, Yunkun,Wu, Jingya,Lu, Wenyan,et al. DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters[J]. IEEE TRANSACTIONS ON COMPUTERS,2024,73(8):2081-2095.
APA Liao, Yunkun,Wu, Jingya,Lu, Wenyan,Li, Xiaowei,&Yan, Guihai.(2024).DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters.IEEE TRANSACTIONS ON COMPUTERS,73(8),2081-2095.
MLA Liao, Yunkun,et al."DPU-Direct: Unleashing Remote Accelerators via Enhanced RDMA for Disaggregated Datacenters".IEEE TRANSACTIONS ON COMPUTERS 73.8(2024):2081-2095.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。