|
|
|
|
|
by frogperson
5 days ago
|
|
I dont think open weight models are going to be a big disrupter. At even a small scale you need a machine with 8 gpus just to run the larger models that still arent that close to SOTA. A machine like that is half a million per year. You might need a second machine for your dev environment or do fine tuning. Then you will need to hire an AI engineer who know who to tune the model. You'll probably need k8s and Dynamo expert to run infra to do the inference. It all gets very expensive, very quickly. Sure some customers will pay this, but many, many more are just going to pay per token. It's the same model that made the hyper-scalers like AWS into giant cash cows. Only this time, tokens can be sold to both enterprises and grandma. Grandma never bought AWS, which means the market for convenient tokens is huge. |
|
All your general knowledge frontier models are just going to be baked into every OS and browser. Coding is almost to point of being viable for self host.. I think the only real use for high end paid models is gonna be kinda like things now... high end scientific workloads.. professional rendering.. especially 3d and video, financials, etc.
A $4,000 mac can take you pretty far right now.. and we have the upcoming dgx hitting mainstream consumers. Give it another couple years. It's changing fast.