Hacker News new | ask | show | jobs
by written-beyond 54 days ago
I thought these TPUs were primarily used for inference?
2 comments

TPU8t is for training. But even still, once you’ve trained, you need to run the model too. And these kinds of models already have a huge latency hit so there’s not much hurting running it away from the trading switches.
As the article states, there's both training and inference dedicated chips.