Hacker News new | ask | show | jobs
by someotherperson 56 days ago
The medium and small players are literally just distilling the larger models.

It's not the smaller players spending billions on training data.

1 comments

No, the medium and small players are the Mistals, DeepSeek and H Company of the world, with their own models using quirky optimisation techniques to be able to compete.