LSFQ: A Low-Bit Full Integer Quantization for High-Performance FPGA-Based CNN Acceleration - Details

Author：

Bao, Zhenshan (Bao, Zhenshan.) | Fu, Guohang (Fu, Guohang.) | Zhang, Wenbo (Zhang, Wenbo.) | Zhan, Kang (Zhan, Kang.) | Guo, Junnan (Guo, Junnan.)

Indexed by：

EI Scopus SCIE

Abstract：

The　effective　implementation　of　quantization　depends　not　only　on　the　specific　task　but　also　on　the　hardware　resources.　This　article　presents　a　hardware-aware　customized　quantization　method　for　convolutional　neural　networks.　We　propose　a　learnable　parameter　soft　clipping　full　integer　quantization　(LSFQ),　which　includes　weight　and　activation　quantization　with　the　learnable　clipping　parameters.　Moreover,　the　LSFQ　accelerator　architecture　is　customized　on　the　field-programmable　gate　array　(FPGA)　platform　to　verify　the　hardware　awareness　of　our　method,　in　which　DSP48E2　is　designed　to　realize　the　parallel　computation　of　six　low-bit　integer　multiplications.　The　results　showed　that　the　accuracy　loss　of　LSFQ　is　less　than　1%　compared　with　the　full-precision　models　including　VGG7,　mobile-net　v2　in　CIFAR10,　and　CIFAR100.　An　LSFQ　accelerator　was　demonstrated　at　the　57th　IEEE/ACM　Design　Automation　Conference　System　Design　Contest　(DAC-SDC)　and　won　the　championship　at　the　FPGA　track.

Keyword：

Accelerator architectures Design automation Computer architecture Quantization (signal) Convolutional neural networks Field programmable gate arrays Training data Neural networks

Author Community：

[ 1 ] [Bao, Zhenshan]Beijing Univ Technol, Beijing, Peoples R China
[ 2 ] [Fu, Guohang]Beijing Univ Technol, Beijing, Peoples R China
[ 3 ] [Zhang, Wenbo]Beijing Univ Technol, Beijing, Peoples R China
[ 4 ] [Zhan, Kang]Beijing Univ Technol, Beijing, Peoples R China
[ 5 ] [Guo, Junnan]Beijing Univ Technol, Beijing, Peoples R China

Reprint Author's Address：

Email：

baozhenshan@bjut.edu.cn |
fuguohang@emails.bjut.edu.cn |
zhangwenbo@bjut.edu.cn |
kangkang@emails.bjut.edu.cn |
guojn@emails.bjut.edu.cn

Show more details

Related Keywords：

Hardware-Friendly 3-D CNN Acceleration With Balanced Kernel Group Sparsity
2024，IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS
UltraAcc: A Customized Low Power and High Performance CNN Accelerator with Dtaflow on FPGAs
2023，Chinese Journal of Computers
Hardware Implementation of On-Chip Learning Neural Network on FPGA
2023，42nd Chinese Control Conference, CCC 2023
Shortcut Convolutional Neural Networks for Classification of Gender and Texture
2017，26th International Conference on Artificial Neural Networks (ICANN)

Source ：

IEEE MICRO

ISSN： 0272-1732

Year： 2022

Issue： 2

Volume： 42

Page： 8-15

3 . 6

JCR@2022

3 . 6 0 0

JCR@2022

ESI Discipline： COMPUTER SCIENCE;

ESI HC Threshold：46

JCR Journal Grade：2

CAS Journal Grade：3

Cited Count：

WoS CC Cited Count： 5

SCOPUS Cited Count： 8

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to