Deep Compression

UIUC CS521, Research Project Exploring Deep Compression for DNNs

Overview

Deep Compression is a pipeline for reducing the size of deep neural nets using a combination of pruning, quantization, and encoding, originally described by Han, Mao, and Dally.

Goals

Reproduce the paper's results, using the PyTorch framework

For AlexNet
For VGG-16

Explore the following pruning techniques

L1 structured pruning
TBD

Explore the following quantization techniques

Incremental network quantization
TBD

Development

ImageNet

The alexnet.py script expects that the ImageNet dataset hosted on Kaggle is available locally. The annotations and data are in separate directories, which means they'll need to be zipped together for validating the model. The data loader expects the locations of those two directories are available in the environment. They can be populated in .env as IMAGENET_ANNOTATIONS_DIR and IMAGENET_DATA_DIR, respectively.

Name		Name	Last commit message	Last commit date
Latest commit History 33 Commits
trained_models		trained_models
.DS_Store		.DS_Store
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
charts.py		charts.py
config.py		config.py
dataloaders.py		dataloaders.py
evaluator.py		evaluator.py
generate_tables.ipynb		generate_tables.ipynb
kaggle_imagenet.py		kaggle_imagenet.py
lenet_size_vs_accuracy_no_prune.png		lenet_size_vs_accuracy_no_prune.png
main.py		main.py
models.py		models.py
pruning_results.csv		pruning_results.csv
prunings.py		prunings.py
quantizations.py		quantizations.py
quanto_quantized_alexnet.py		quanto_quantized_alexnet.py
requirements.txt		requirements.txt
results.csv		results.csv
size_vs_accuracy.png		size_vs_accuracy.png
size_vs_accuracy_no_prune.png		size_vs_accuracy_no_prune.png
size_vs_accuracy_trimmed.png		size_vs_accuracy_trimmed.png

tninesling/deep-compression

Folders and files

Latest commit

History

Repository files navigation

Deep Compression

Overview

Goals

Development

ImageNet

About

Resources

Stars

Watchers

Forks

Languages