Hacker News new | ask | show | jobs
by fizixer 2235 days ago
You still need tensor cores for inference. But they don't do weight updates. Learning/training is all about updating the weights (through backpropagation or whatever).

So another way to put it: its tensor cores do feed-forward calculations, but no backpropagation, and no weight updates.

1 comments

The hardware and platform is capable of training just fine. It's just rarely done because it is slower than training on pretty much any discrete GPU.