Model Compression: Theory, Practice, and Beyond

less than 1 minute read

Published:

Background

Knowledge Distillation

Model Pruning

Quantization

Tensor Decomposition

Training-free Compression