Hacker News new | ask | show | jobs
by cadamsdotcom 513 days ago
Synthetic data is doing wonders for models like Phi-4, and at least part of the dataset for DeepSeek-R1 came from their earlier models.

If you read the literature from the Phi-4 team it talks about synthetic data allowing better control over the training process. The upfront investment is greater but pays off over multiple generations of trained models - and doesn’t leave you with SolidGoldMagikarp ;)

https://www.lesswrong.com/posts/aPeJE8bSo6rAFoLqg/solidgoldm...