|
|
|
|
|
by vinni2
5 days ago
|
|
> every dollar you're spending on trying to train larger models is a losing prop You probably don’t know how smaller models are trained then. Most of them are knowledge distilled or trained using data generated from larger models. If larger models are stopped there is no magical way smaller models will keep getting better. |
|