|
|
|
|
|
by hodgehog11
318 days ago
|
|
Working in the theory, I can say this is incredibly unlikely. At scale, once appropriately trained, all architectures begin to converge in performance. It's not architectures that matter anymore, it's unlocking new objectives and modalities that open another axis to scale on. |
|