Hacker News new | ask | show | jobs
Scaling Laws for DiLoCo (arxiv.org)
2 points by isof4ult 460 days ago