Hacker News new | ask | show | jobs
by LeanderK 1496 days ago
> The neural engine is small and inference only

Why is it inference only? At least the operations are the same...just a bunch of linear algebra

1 comments

Inference is often done fixed point, whereas training is (usually) floating point.

Inference also prefers different IO patterns, because you don't need to keep the activations for every layer ready for backpropogation.