中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
Accelerating deep neural network filter pruning with mask-aware convolutional computations on modern CPUs

文献类型:期刊论文

作者Ma, Xiu3,4; Li, Guangli1,2; Liu, Lei3,4; Liu, Huaxiao3,4; Wang, Xueying1,2
刊名NEUROCOMPUTING
出版日期2022-09-21
卷号505页码:375-387
关键词Deep learning systems Neural network compression Filter pruning
ISSN号0925-2312
DOI10.1016/j.neucom.2022.07.006
英文摘要Filter pruning, a representative model compression technique, has been widely used to compress and accelerate sophisticated deep neural networks on resource-constrained platforms. Nevertheless, most studies focus on reducing the cost of model inference, whereas the heavy burden of the pruning optimiza-tion process is neglected. In this paper, we propose MaskACC, a mask-aware convolutional computation method, which accelerates the prevailing mask-based filter pruning process on modern CPU platforms. MaskACC dynamically reorganizes the tensors used in convolutions with the mask information to avoid unnecessary computations, thereby improving the computational efficiency of the pruning process. Evaluation with state-of-the-art neural network models on CPU cloud platforms demonstrates the effec-tiveness of our method, which achieves up to 1.61x speedup under commonly-used pruning rates, com-pared to conventional computations. (c) 2022 Elsevier B.V. All rights reserved.
资助项目National Key R&D Program of China[2021ZD0110101] ; National Natural Science Foundation of China[61872043] ; CCF- Huawei Populus Grove Fund ; Fundamental Research Funds for the Central Universities
WOS研究方向Computer Science
语种英语
WOS记录号WOS:000861364900010
出版者ELSEVIER
源URL[http://119.78.100.204/handle/2XEOYT63/19806]  
专题中国科学院计算技术研究所期刊论文
通讯作者Li, Guangli
作者单位1.Univ Chinese Acad Sci, Beijing, Peoples R China
2.Chinese Acad Sci, Inst Comp Technol, State Key Lab Processors, Beijing, Peoples R China
3.Jilin Univ, Key Lab Symbol Computat & Knowledge Engn, Minist Educ, Changchun, Peoples R China
4.Jilin Univ, Coll Comp Sci & Technol, Changchun, Peoples R China
推荐引用方式
GB/T 7714
Ma, Xiu,Li, Guangli,Liu, Lei,et al. Accelerating deep neural network filter pruning with mask-aware convolutional computations on modern CPUs[J]. NEUROCOMPUTING,2022,505:375-387.
APA Ma, Xiu,Li, Guangli,Liu, Lei,Liu, Huaxiao,&Wang, Xueying.(2022).Accelerating deep neural network filter pruning with mask-aware convolutional computations on modern CPUs.NEUROCOMPUTING,505,375-387.
MLA Ma, Xiu,et al."Accelerating deep neural network filter pruning with mask-aware convolutional computations on modern CPUs".NEUROCOMPUTING 505(2022):375-387.

入库方式: OAI收割

来源:计算技术研究所

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。