LSFQ: A Low Precision Full Integer Quantization for High-Performance FPGA-Based CNN Acceleration - Details

Author：

Bao, Zhenshan (Bao, Zhenshan.) | Zhan, Kang (Zhan, Kang.) | Zhang, Wenbo (Zhang, Wenbo.) | Guo, Junnan (Guo, Junnan.)

Indexed by：

Abstract：

Neural　network　quantization　has　become　an　important　research　area.　Deep　networks　run　with　low　precision　operations　at　inference　time　offer　power　and　space　advantages　over　high　precision　alternatives,　and　can　maintain　high　accuracy.　However,　few　quantization　can　demonstrate　this　advantage　on　hardware　platform,　because　the　design　of　quantization　algorithm　lacks　the　consideration　of　actual　hardware　implementation.　In　this　paper,　we　propose　an　efficient　quantization　method　for　hardware　implementation,　a　learnable　parameter　soft　clipping　fully　integer　quantization　(LSFQ),　which　includes　weight　quantization　and　activation　quantization　with　learnable　clipping　parameter　method.　The　quantization　parameters　are　optimized　automatically　by　back　propagation　to　minimize　the　loss,　then　the　BatchNorm　layer　and　convolutional　layer　are　fused,　and　the　bias　and　quantization　step　size　are　further　quantized.　In　this　way,　LSFQ　accomplishes　integer-only-arithmetic.　We　evaluate　the　quantization　algorithm　on　a　variety　of　models　including　VGG7,　mobile-net　v2　in　CIFAR10　and　CIFAR100.　The　results　show　that　when　the　quantization　reaches　3-bit　or　4-bit,　the　accuracy　loss　of　our　method　is　less　than　1　%　compared　with　the　full-precision　network.　In　addition,　we　design　an　accelerator　for　the　quantization　algorithm　and　deploy　it　to　the　FPGA　platform　to　verify　the　hardware-awareness　of　our　method.　©　2021　IEEE.

Keyword：

Backpropagation Field programmable gate arrays (FPGA) Integrated circuit design

Author Community：

[ 1 ] [Bao, Zhenshan]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 2 ] [Zhan, Kang]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 3 ] [Zhang, Wenbo]Beijing University of Technology, Faculty of Information Technology, Beijing, China
[ 4 ] [Guo, Junnan]Beijing University of Technology, Faculty of Information Technology, Beijing, China

Reprint Author's Address：

Email：

Show more details

Related Keywords：

Hardware Implementation of On-Chip Learning Neural Network on FPGA
2023，42nd Chinese Control Conference, CCC 2023
A design of hardware cryptographic co-processor
2003，IEEE Systems, Man and Cybernetics Society Information Assurance Workshop
Design of ethernet platform of SOPC based on ARM soft core
2018，12th IEEE International Conference on Anti-Counterfeiting, Security, and Identification, ASID 2018
Hardware design of image information processor based on ADSP-TS201 DSPs
2009，2009 IEEE International Workshop on Imaging Systems and Techniques, IST 2009
A high-accuracy design of frequency divider with FPGA and ASIC implementation
2012，2012 International Conference on Industrial Control and Electronics Engineering, ICICEE 2012

Source ：

Year： 2021

Language： English

Cited Count：

WoS CC Cited Count： 0

SCOPUS Cited Count： 16

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 0

Affiliated Colleges：

信息学部

Get Fulltext

DOI Library Discovery Baidu Scholar Search Engineering Village

Type
Departments

All Years Choose Year From to