中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster

文献类型:会议论文

作者Wang Lei ; Zhang Yunquan ; Zhang Xianyi ; Liu Fangfang
出版日期2010
会议名称10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010
会议日期37436
会议地点Bradford, United kingdom
关键词Embedded software Embedded systems Information technology Linear systems Program processors
页码1169-1174
英文摘要In this paper, the mixed precision algorithm to solve the linear system of equations and the implementation of HPL package are introduced. We use this mixed precision algorithm to improve HPL package on CPU+GPGPU heterogeneous clusters, which is named for GHPL, and give the implementation mechanisms in detail. The experimental results are measured on the platforms of multi-core CPUs and CPU+GPGPU heterogeneous clusters. From the experimental results, we can find out that our GHPL program has good scalability on all the experimental environments and can sustain more than 1.7Teraflops both on the cluster with 16 nodes containing 32 NVIDIA Tesla C1060 GPUs and on the cluster with 8 nodes containing 32 NVIDIA GeForce GTX 295 GPUs, while the average speedup of it with respect to HPL is 3.06 and 2.40 respectively. © 2010 IEEE.
收录类别EI
会议主办者University of Bradford; IEEE; IEEE Computer Society; IEEE TCSC; IEEE Industry Applications Society (IAS)
会议录Proceedings - 10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, ScalCom-2010
会议录出版地United States
ISBN号9780770000000
源URL[http://124.16.136.157/handle/311060/8642]  
专题软件研究所_并行计算实验室 _会议论文
推荐引用方式
GB/T 7714
Wang Lei,Zhang Yunquan,Zhang Xianyi,et al. accelerating linpack performance with mixed precision algorithm on cpu+gpgpu heterogeneous cluster[C]. 见:10th IEEE International Conference on Computer and Information Technology, CIT-2010, 7th IEEE International Conference on Embedded Software and Systems, ICESS-2010, 10th IEEE Int. Conf. Scalable Computing and Communications, ScalCom-2010. Bradford, United kingdom. 37436.

入库方式: OAI收割

来源:软件研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。