Hacker News new | ask | show | jobs
by vomjom 3051 days ago
Keep in mind that what you linked refers to TPUv1, which is built for quantized 8-bit inference. The TPUv2, which was announced in this blog post, is for general purpose training and uses 32-bit weights, activations, and gradients.

It will have very different performance characteristics.

1 comments

Thanks for pointing that out!