Post to Tumblr - Preview

Optimizing TensorFlow models with Quantization Techniques - Drops of AI

Quantized model would be around 4x smaller and hardware support computations is typically 2-4 times faster with INT8. Quantization techniques

Kartik Chaudhary