Hacker News new | ask | show | jobs
by Lucasoato 2199 days ago
> Each move during self-play uses about 0.4 seconds of computer thinking time.

> Over 72 hours, 4.9 million matches were played.

One of this claim must be incorrect or misinterpreted, I highly doubt they used so many TPU's as the article claims. That would be not only impractical but also it would raise a lot of other issues like networking, disk speed... etc...

My statement is not against this article, if anyone can confirm they used so many TPUs in parallel feel free to post it

1 comments

72 hours are 259200 seconds.

Playing 4.9 million matches of ~100 plies each at 0.4 seconds per ply is 196000000 seconds.

That's < 1000 TPUs. Sounds big but not too-large-for-google big. But other comments here say that the 0.4 second number is also wrong (and in fact significantly lower).