Abstract

CODE FOR CS231N PAPER

DISTILLING KNOWLEDGE TO SPECIALIST CONVOLUTIONAL NEURAL NETWORKS FOR CLUSTERED CLASSIFICATION

Abstract

Most realistic datasets for computer vision tasks tend to have a large number of classes, which are unevenly distributed in the label space, and can even be clustered in categories, like in popular benchmark datasets such as ImageNet or CIFAR-100. Typical convolutional neural networks often fail to generalize well on these datasets, especially when the number or image per class is small. A natural idea, when one does not want to work with huge networks that are impossible to transfer to small devices (both for memory and time constraints), would be to train an ensemble of experts, each one specialized on a subset of the dataset's classes. However, those expert networks tend to overfit a lot. To address this issue, we propose to leverage the concept of knowledge distillation, recently proposed by Hinton \etal ~\cite{darkknowledge}, to train those networks. This technique can act as a very strong regularizer, and can allow us to achieve good results on this type of dataset, with a significant speed-up (both for training and prediction) and memory gain.

After introducing the theoretical foundations of knowledge distillation, we present the different components of the necessary pipeline in the case of specialist networks, and various ways of improving the results. We also show and discuss our experiments and results on a particular dataset, CIFAR-100, which classes presents a natural clustered structure.

Name		Name	Last commit message	Last commit date
Latest commit History 97 Commits
compression		compression
data		data
master		master
specialists		specialists
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Report.pdf		Report.pdf
create_provider.lua		create_provider.lua
custom_criterion.lua		custom_criterion.lua
master_scores.lua		master_scores.lua
provider.lua		provider.lua
search.lua		search.lua
search_specialist.lua		search_specialist.lua
specialist_scores.lua		specialist_scores.lua
train_compressed.lua		train_compressed.lua
train_concat_spec.lua		train_concat_spec.lua
train_master.lua		train_master.lua
train_specialists.lua		train_specialists.lua
unsupervised_provider.lua		unsupervised_provider.lua

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

CODE FOR CS231N PAPER

DISTILLING KNOWLEDGE TO SPECIALIST CONVOLUTIONAL NEURAL NETWORKS FOR CLUSTERED CLASSIFICATION

Abstract

About

Releases

Packages

Contributors 2

Languages

License

natoromano/specialistnets

Folders and files

Latest commit

History

Repository files navigation

CODE FOR CS231N PAPER

DISTILLING KNOWLEDGE TO SPECIALIST CONVOLUTIONAL NEURAL NETWORKS FOR CLUSTERED CLASSIFICATION

Abstract

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages