Sparse optimization guided pruning for neural networks - Details

Author：

Shi, Yong (Shi, Yong.) | Tang, Anda (Tang, Anda.) | Niu, Lingfeng (Niu, Lingfeng.) | Zhou, Ruizhi (Zhou, Ruizhi.)

Indexed by：

EI Scopus SCIE

Abstract：

Neural　network　pruning　is　a　critical　field　aimed　at　reducing　the　infrastructure　costs　of　neural　networks　by　removing　parameters.　Traditional　methods　follow　a　fixed　paradigm　including　pretraining,　pruning,　and　fine-tuning.　Despite　the　close　relationship　among　these　three　stages,　most　pruning　methods　treat　them　as　independent　processes.　In　this　paper,　we　propose　a　novel　two-stage　pruning　method,　which　includes　pretraining　a　network　that　is　instructive　for　subsequent　pruning,　and　a　unified　optimization　model　that　integrates　pruning　and　fine-tuning.　Specifically,　in　the　first　stage,　we　design　a　group　sparse　regularized　model　for　pretraining.　This　model　not　only　safeguards　the　network　from　irreversible　damage　but　also　offers　valuable　insights　for　the　pruning　process.　In　the　second　stage,　we　introduce　an　element-wise　sparse　regularization　into　pruning　model.　This　model　enables　us　to　pinpoint　sparse　weights　more　precisely　than　pretrained　network.　It　automatically　derives　effective　pruning　criteria,　and　omits　the　step　of　fine-tuning.　To　implement　the　two-stage　process　in　practice,　we　utilize　stochastic　gradient　algorithm　for　the　pretraining　and　design　a　threshold　algorithm　for　pruning　stage.　Extensive　experiments　confirm　the　competitive　performance　of　our　proposed　method　in　terms　of　both　accuracy　and　memory　cost　when　compared　to　various　benchmarks.　Furthermore,　ablation　experiments　validate　the　effectiveness　of　the　proposed　pretraining　model＇s　guidance　for　the　pruning　process.

Keyword：

Deep neural networks Model compression Sparse optimization

Author Community：

[ 1 ] [Tang, Anda]Univ Chinese Acad Sci, Sch Math Sci, Beijing 100190, Peoples R China
[ 2 ] [Niu, Lingfeng]Univ Chinese Acad Sci, Sch Econ & Management, Beijing 100190, Peoples R China
[ 3 ] [Shi, Yong]Chinese Acad Sci, Res Ctr Fictitious Econ & Data Sci, Beijing 100190, Peoples R China
[ 4 ] [Shi, Yong]Chinese Acad Sci, Key Lab Big Data Min & Knowledge Management, Beijing 100190, Peoples R China
[ 5 ] [Zhou, Ruizhi]Beijing Univ Technol, Inst Operat Res & Informat Engn, Beijing 100124, Peoples R China

Reprint Author's Address：

[Zhou, Ruizhi]Beijing Univ Technol, Inst Operat Res & Informat Engn, Beijing 100124, Peoples R China

Email：

zhourz@bjut.edu.cn

Show more details

Related Keywords：

Weakly Supervised Real-time Object Detection Based on Saliency Map
2020，Acta Automatica Sinica
Device-Free Sensing for Gesture Recognition by Wi-Fi Communication Signal Based on Auto-encoder/decoder Neural Network
2020，8th International Conference on Communications, Signal Processing, and Systems, CSPS 2019
A lightweight collaborative recognition system with binary convolutional neural network for mobile web augmented reality
2019，39th IEEE International Conference on Distributed Computing Systems, ICDCS 2019
Network Anomaly Detection Using Federated Learning and Transfer Learning
2020，1st International Conference on Security and Privacy in Digital Economy, SPDE 2020

Source ：

NEUROCOMPUTING

ISSN： 0925-2312

Year： 2024

Volume： 574

6 . 0 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 10

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to