The quantization of CNN/LSTM

=======

The quantization of CNN/LSTM

Ongoing work that tries to quantize the weights, activations as well as gradients to k-bit representation for model compression. Use bit-wise operation to speed up inference procedure in test phase.

Implemented tasks:

(1) Add new layer for weights quantization
(2) Add new layer for activations quantization
(3) CPU and GPU versions of above tasks

The implementation is based on Caffe (https://github.com/BVLC/caffe).

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
cmake		cmake
examples/cifar10_quantize		examples/cifar10_quantize
include/caffe		include/caffe
src		src
tools		tools
.Doxyfile		.Doxyfile
.gitignore		.gitignore
.travis.yml		.travis.yml
CMakeLists.txt		CMakeLists.txt
CONTRIBUTING.md		CONTRIBUTING.md
CONTRIBUTORS.md		CONTRIBUTORS.md
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md
caffe.cloc		caffe.cloc

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The quantization of CNN/LSTM

About

Releases

Packages

Languages

License

deepsemantic/quantized_neural_network

Folders and files

Latest commit

History

Repository files navigation

The quantization of CNN/LSTM

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages