| Thanks for sharing and very insightful. Guess the TPUs are the real deal. About 1/2 the cost for similar performance. Would assume Google is able to do that because of the less power required. I am actually more curious to get a paper on the new speech NN Google is using. Suppose to be 16k samples a second through a NN is hard to imagine how they did that and was able to roll it out as you would think the cost would be prohibitive. You are ultimately competing with a much less compute heavy solution. https://cloudplatform.googleblog.com/2018/03/introducing-Clo... Suspect this was only possible because of the TPUs. Can't think of anything else where controlling the entire stack including the silicon would be more important than AI applications. |
You can’t buy a TPU, it’s a cloud only thing. They also show it’s not a huge difference in both perf and time to converge (albeit only one architecture)
I would say kudos to V100 and this benchmark that breaks the TPU hype.