|
|
|
|
|
by AshamedCaptain
8 days ago
|
|
> You can post-train any LLM very easily without access to the original training data. Are you claiming this is e.g. what Alibaba spends their time doing? My point is that the usefulness of this is limited _in comparison to the one provided by having their training data AND mechanisms_. |
|
Not most of the time (pre-training takes a long time), but post-training is where most of the value is, yes.
Famously it is all that OpenAI did between GPT 4o and GPT 5.3 (or 5.2?) - they didn't manage to complete a pre-training run[1], and all their progress was done with post-training (!)
Post training what Cursor spends their time doing, and that has built a model that is competitive with the best coding models out there.
It isn't limited at all.
If you want to complain about something not being open source, complain about the lack of good open source RL environments (Prime Intellect excepted).
[1] https://newsletter.semianalysis.com/p/tpuv7-google-takes-a-s...