|
|
|
|
|
by stingraycharles
31 days ago
|
|
The economics of local AI just doesn’t make sense. A model like Opus is - supposedly - something like 5T parameters, which is likely something like 3TB of GPU memory. Local models never reach the % utilization that cloud providers have (80%+), and they’re always going to be much better than local models for this reason. |
|
It’s not unreasonable to suppose that in 2 years time an opus 5 quality model will be etched into silicon for high performance local inference. Then you just upgrade your model every 2-3 years by upgrading your hardware.