Diluted binary neural network - Details - 北京工业大学机构库

Author：

Lin, Yuhan (Lin, Yuhan.) | Niu, Lingfeng (Niu, Lingfeng.) | Xiao, Yang (Xiao, Yang.) | Zhou, Ruizhi (Zhou, Ruizhi.)

Indexed by：

EI Scopus SCIE

Abstract：

Binary　neural　networks　(BNNs)　are　promising　on　resource-constrained　devices　because　they　reduce　mem-ory　consumption　and　accelerate　inference　effectively.　However,　they　are　still　potential　on　performance　improvement.　Prior　studies　attribute　performance　degradation　of　BNNs　to　limited　representation　ability　and　gradient　mismatch.　In　this　paper,　we　find　that　it　also　results　from　the　mandatory　representation　of　small　full-precision　auxiliary　weights　to　large　values.　To　tackle　with　this　issue,　we　propose　an　approach　dubbed　as　Diluted　Binary　Neural　Network　(DBNN).　Besides　avoiding　mandatory　representation　effectively,　the　proposed　DBNN　also　alleviates　sign　flip　problem　to　a　large　extent.　For　activations,　we　jointly　min-imize　quantization　error　and　maximize　information　entropy　to　develop　the　binarization　scheme.　Com-pared　with　existing　sparsity-binarization　approaches,　DBNN　trains　network　from　scratch　without　other　procedures　and　achieves　larger　sparsity.　Experiments　on　several　datasets　with　various　networks　demon-strate　the　superiority　of　our　approach.　(c)　2023　Elsevier　Ltd.　All　rights　reserved.

Keyword：

Model compression Network quantization Binary neural network Sparse regularization Ternary neural network

Author Community：

[ 1 ] [Lin, Yuhan]Univ Chinese Acad Sci, Sch Math Sci, Beijing 100049, Peoples R China
[ 2 ] [Lin, Yuhan]Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[ 3 ] [Niu, Lingfeng]Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[ 4 ] [Xiao, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China
[ 5 ] [Zhou, Ruizhi]Beijing Univ Technol, Fac Sci, Beijing 100124, Peoples R China

Reprint Author's Address：

[Xiao, Yang]Beijing Univ Technol, Fac Informat Technol, Beijing 100124, Peoples R China;;

Email：

linyuhan19@mails.ucas.ac.cn |
niulf@ucas.ac.cn |
xiaoyang@bjut.edu.cn |
zhourz@bjut.edu.cn

Show more details

Related Keywords：

Sparse optimization guided pruning for neural networks
2024，NEUROCOMPUTING
Using Distillation to Improve Network Performance after Pruning and Quantization
2019，2nd International Conference on Machine Learning and Machine Intelligence (MLMI)
Multi-stage knowledge distillation for sequential recommendation with interest knowledge
2024，INFORMATION SCIENCES
ARPruning: An automatic channel pruning based on attention map ranking
2024，NEURAL NETWORKS

Source ：

PATTERN RECOGNITION

ISSN： 0031-3203

Year： 2023

Volume： 140

8 . 0 0 0

JCR@2022

ESI Discipline： ENGINEERING;

ESI HC Threshold：19

Cited Count：

WoS CC Cited Count： 4

SCOPUS Cited Count： 4

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 3

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to