Introduction
  • A brief introduction about deep learning
  • Model compression overview
  • Demo CNN quantization in Tensorflow
  • Model Size Estimation
Compression Algorithm: pruning
  • pruning overview
  • Pruning Overview
  • Hessian based pruning
  • Group Hessian pruning
  • Hessian pruning
  • Hessian Reuse 1
  • Hessian Reuse 2
Compression Algorithm: quantization
  • fixed point quantization
  • Fixed Point Quantization
Compression Algorithm: distillation
  • knowledge distillation cross entropy
  • KD Cross Entropy
Compression Algorithm: factorization
  • SVD factorization
  • SVD Factorization