|
|
|
|
|
by alanbernstein
848 days ago
|
|
> they just come from training bigger models on the same data Are you arguing that all AI models are using the same network structure? This is only true in the most narrow sense, looking at models that are strictly improvements over previous generation models. It ignores the entire field of research that works by developing new models with new structures, or combining ideas from multiple previous works. |
|
The exception is when you care about efficiency (in training or inference costs) but at the limit or if you care about "better" then you don't.