|
|
|
|
|
by jononor
19 days ago
|
|
These models will never compete with frontier models and do not need to - it is about hitting a good-enough, not being the best.
Behind the frontier, getting to a certain performance level, is getting easier over time - both sample and compute efficiency is going up. Furthermore one can reuse investments in data (both agreements, infrastructure and datasets), compute (GPUs, servers) and know-how (training scripts, experienced engineers). |
|
I understand and agree building the LLMs yourself comes with more benefits, long-term ones especially, but still it's harder, more expensive and really time consuming work.