Learning From Teacher's Failure: A Reflective Learning Paradigm for Knowledge Distillation - Details

Author：

Xu, Kai (Xu, Kai.) | Wang, Lichun (Wang, Lichun.) (Scholars：王立春) | Xin, Jianjia (Xin, Jianjia.) | Li, Shuang (Li, Shuang.) | Yin, Baocai (Yin, Baocai.)

Indexed by：

EI Scopus SCIE

Abstract：

Knowledge　Distillation　transfers　knowledge　learned　by　a　teacher　network　to　a　student　network.　A　common　mode　of　knowledge　transfer　is　directly　using　the　teacher　network＇s　experience　for　all　samples　without　differentiating　whether　the　experience　of　teacher　is　successful　or　not.　According　to　common　sense,　experience　varies　with　its　nature.　Successful　experience　is　used　for　guidance,　and　failed　experience　is　used　for　correction.　Inspired　by　that,　this　paper　analyzes　the　failure　of　teacher　and　proposes　a　reflective　learning　paradigm,　which　additionally　uses　heuristic　knowledge　extracted　from　the　teacher＇s　failure　besides　following　the　authority　of　teacher.　Specifically,　this　paper　defines　Mutual　Error　Distance　(MED)　based　on　the　teacher＇s　wrong　predictions.　MED　measures　the　adequacy　of　the　decision　boundary　learned　by　teacher,　which　concretizes　the　failure　of　teacher.　Then,　this　paper　proposes　DCGD　(divide-and-conquer　grouping　distillation)　to　critically　transfer　the　teacher＇s　knowledge　by　grouping　the　target　task　into　small-scale　subtasks　and　designing　multi-branch　networks　on　the　basis　of　MED.　Finally,　a　switchable　training　mechanism　is　designed　to　integrate　a　regular　student　which　provides　an　option　of　student　network　without　parameter　addition　compared　with　the　multi-branch　student　network.　Extensive　experiments　on　three　image　classification　benchmarks　(CIFAR-10,　CIFAR-100　and　TinyImageNet)　show　the　effectiveness　of　the　proposed　paradigm.　Especially　on　CIFAR-100　dataset,　the　average　error　of　students　using　DCGD+DKD　decreased　by　4.28%.　In　addition,　the　experiment　results　show　that　the　paradigm　is　also　applicable　to　self-distillation.

Keyword：

decision boundary Training mutual error distance divide-and-conquer Knowledge engineering Task analysis Birds Dogs Marine vehicles reflective learning paradigm Automobiles Knowledge distillation

Author Community：

[ 1 ] [Xu, Kai]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China
[ 2 ] [Wang, Lichun]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China
[ 3 ] [Xin, Jianjia]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China
[ 4 ] [Li, Shuang]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China
[ 5 ] [Yin, Baocai]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China

Reprint Author's Address：

[Wang, Lichun]Beijing Univ Technol, Beijing Artificial Intelligence Inst, Fac Informat Technol, Beijing Key Lab Multimedia & Intelligent Software, Beijing, 100124, Peoples R China;;

Email：

xukai@emails.bjut.edu.cn |
wanglc@bjut.edu.cn |
xinjianjia@emails.bjut.edu.cn |
shuangli@emails.bjut.edu.cn |
ybc@bjut.edu.cn

Show more details

Related Keywords：

Self-Distillation With Augmentation in Feature Space
2024，IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY
Efficient Dual-Attention-Based Knowledge Distillation Network for Unsupervised Wafer Map Anomaly Detection
2024，IEEE TRANSACTIONS ON SEMICONDUCTOR MANUFACTURING
Self-supervised knowledge distillation for complementary label learning
2022，NEURAL NETWORKS
Using Distillation to Improve Network Performance after Pruning and Quantization
2019，2nd International Conference on Machine Learning and Machine Intelligence (MLMI)

Source ：

IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY

ISSN： 1051-8215

Year： 2024

Issue： 1

Volume： 34

Page： 384-396

8 . 4 0 0

JCR@2022

Cited Count：

WoS CC Cited Count：

SCOPUS Cited Count： 9

ESI Highly Cited Papers on the List： 0 Unfold All

WanFang Cited Count：

Chinese Cited Count：

30 Days PV： 1

Affiliated Colleges：

Get Fulltext

DOI Library Discovery Baidu Scholar Search Web of Science

Type
Departments

All Years Choose Year From to