Hacker News new | ask | show | jobs
by elephantum 1734 days ago
Do the same optimizations apply to tensorflow/tensorflow serving?
1 comments

TensorFlow doesn't support quantized inference (it supports only mimicking quantization in floating-point for quantization-aware training), so it can't immediately benefit from these optimizations.