Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Paper has been accepted by Neurocomputing 415(2020): 106–113.

Authors: Yuang Liu, Wei Zhang and Jun Wang.

Links: [ pdf ] [ code ]

Requirements

PyTorch >= 1.0.0
Jupyter
visdom

Introduction

Knowledge distillation (KD) is an effective learning paradigm for improving the performance of light-weight student networks by utilizing additional supervision knowledge distilled from teacher networks. Most pioneering studies either learn from only a single teacher in their distillation learning methods, neglecting the potential that a student can learn from multiple teachers simultaneously, or simply treat each teacher to be equally important, unable to reveal the different importance of teachers for specific examples. To bridge this gap, we propose a novel adaptive multi-teacher multi-level knowledge distillation learning framework (AMTML-KD), which consists two novel insights: (i) associating each teacher with a latent representation to adaptively learn instance-level teacher importance weights which are leveraged for acquiring integrated soft-targets (high-level knowledge) and (ii) enabling the intermediate-level hints (intermediate-level knowledge) to be gathered from multiple teachers by the proposed multi-group hint strategy. As such, a student model can learn multi-level knowledge from multiple teachers through AMTML-KD. Extensive results on publicly available datasets demonstrate the proposed learning framework ensures student to achieve improved performance than strong competitors.

Citation

@article{LIU2020106,
    title = {Adaptive multi-teacher multi-level knowledge distillation},
    author = {Yuang Liu and Wei Zhang and Jun Wang},
    journal = {Neurocomputing},
    volume = {415},
    pages = {106 -- 113},
    year = {2020},
    issn = {0925 -- 2312},
}

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
dataset		dataset
figures		figures
metric		metric
models		models
utils		utils
.gitignore		.gitignore
AT_KD.ipynb		AT_KD.ipynb
DML_2S.ipynb		DML_2S.ipynb
DML_3S.ipynb		DML_3S.ipynb
DML_3S_only.ipynb		DML_3S_only.ipynb
DML_5S_only.ipynb		DML_5S_only.ipynb
FitNet_distill_hook.ipynb		FitNet_distill_hook.ipynb
LICENSE		LICENSE
LR_adaptive_AT.ipynb		LR_adaptive_AT.ipynb
LR_adaptive_Hint.ipynb		LR_adaptive_Hint.ipynb
LR_adaptive_KD.ipynb		LR_adaptive_KD.ipynb
LR_adaptive_learning.ipynb		LR_adaptive_learning.ipynb
LR_adaptive_triplets.ipynb		LR_adaptive_triplets.ipynb
README.md		README.md
RKD.ipynb		RKD.ipynb
adaptive_FitNet_teacher-level.ipynb		adaptive_FitNet_teacher-level.ipynb
group_Hint_only.ipynb		group_Hint_only.ipynb
multi_teacher_avg_distill.ipynb		multi_teacher_avg_distill.ipynb
nets_training.ipynb		nets_training.ipynb
original_KD.ipynb		original_KD.ipynb
train_resnet.py		train_resnet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Requirements

Introduction

Citation

About

Releases

Packages

Languages

License

FLHonker/AMTML-KD-code

Folders and files

Latest commit

History

Repository files navigation

Adaptive Multi-Teacher Multi-level Knowledge Distillation(AMTML-KD)

Requirements

Introduction

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages