| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Marat_Dukhan 1735 days ago
	In order to benefit from optimizations in this blog post the model needs to be quantized to 8-bit integers. However, XNNPACK supports floating-point inference as well (including with FP16 weights), see https://blog.tensorflow.org/2020/07/accelerating-tensorflow-...

1 comments

Thanks!