Hacker News new | ask | show | jobs
by dialup_sounds 662 days ago
https://arxiv.org/abs/2407.21075

AFM-server was trained on 8192 TPUv4 chips

Someone more versed can say if that is huge or not.

1 comments

It is a far larger scale than most high performance clusters offer.