Hacker News new | ask | show | jobs
by shdh 63 days ago
Likely accurate

This tends to happen during pretraining phase of new models

Happened with 3.x too