中国科学院机构知识库网格
Chinese Academy of Sciences Institutional Repositories Grid
A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation

文献类型:期刊论文

作者Jiang, Tao2,3; Xing, Ligang1; Yu, Jinming1; Qian, Junchao1,2,3
刊名JOURNAL OF REAL-TIME IMAGE PROCESSING
出版日期2024-08-01
卷号21
关键词Convolution neural networks CNN quantization Hardware accelerator FPGA
ISSN号1861-8200
DOI10.1007/s11554-024-01484-y
通讯作者Yu, Jinming(sdyujinming@163.com) ; Qian, Junchao(qianjunchao@hmfl.ac.cn)
英文摘要Convolutional Neural Networks (CNNs) have been widely used in various fields due to their high accuracy and efficiency. The performance of CNNs is mainly affected by the computing capability, memory bandwidth, and flexibility of embedded devices. The high energy efficiency, computing capability, and reconfigurability of FPGAs make it a good platform for hardware acceleration in the design of CNNs. However, the increase of complexity of CNNs, requires memory while the FPGA on-chip storage is limited. Therefore, we use an improved logarithmic quantization to compress the model. This approach allows for significant reduction in bit widths while maintaining high accuracy levels, making it an effective compression method. In this work, a hardware-friendly quantization scheme is proposed, in which the weights use improved logarithmic quantization scheme, and the quantization scheme of activations use the fixed-point-to-logarithmic. The results show that the quantization model has negligible Top-1/5 accuracy loss without any retraining. In addition, we implement an acceleration engine for a heterogeneous Generalized Matrix Multiplication (GEMM) core on Zynq XC7Z020. In GEMM, the multiplier is replaced by logic shifters and adders, which achieves efficient utilization of LUT resources. We use the optimal quantization model on Zynq XC7Z020. The throughput reaches 69.7 GOPs with a power consumption of 6.008W, and the resource efficiency is 8.713 GOPs/DSP or 5.564 GOPs/kLUTs.
资助项目National Natural Science Foundation of China
WOS研究方向Computer Science ; Engineering ; Imaging Science & Photographic Technology
语种英语
WOS记录号WOS:001243615300001
出版者SPRINGER HEIDELBERG
资助机构National Natural Science Foundation of China
源URL[http://ir.hfcas.ac.cn:8080/handle/334002/136218]  
专题中国科学院合肥物质科学研究院
通讯作者Yu, Jinming; Qian, Junchao
作者单位1.Shandong First Med Univ & Shandong Acad Med Sci, Shandong Univ, Shandong Canc Hosp & Inst, Dept Radiat Oncol,Sch Med, Jinan 250117, Peoples R China
2.Chinese Acad Sci, Hefei Canc Hosp, Inst Hlth & Med Technol, Hefei Inst Phys Sci,Anhui Prov Key Lab Med Phys &, Hefei 230031, Peoples R China
3.Anhui Jianzhu Univ, Sch Elect & Informat Engn, Dept Elect Sci & Technol, Hefei 230601, Peoples R China
推荐引用方式
GB/T 7714
Jiang, Tao,Xing, Ligang,Yu, Jinming,et al. A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation[J]. JOURNAL OF REAL-TIME IMAGE PROCESSING,2024,21.
APA Jiang, Tao,Xing, Ligang,Yu, Jinming,&Qian, Junchao.(2024).A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation.JOURNAL OF REAL-TIME IMAGE PROCESSING,21.
MLA Jiang, Tao,et al."A hardware-friendly logarithmic quantization method for CNNs and FPGA implementation".JOURNAL OF REAL-TIME IMAGE PROCESSING 21(2024).

入库方式: OAI收割

来源:合肥物质科学研究院

浏览0
下载0
收藏0
其他版本

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。