| > Sure, I can run some slow Llama3 models on my home network, but why bother when it is so cheap or free to run it on a cloud service? Obvious answer: because it's not free, and it's not cheap. If you're playing with a UI library, lets say, QT... would you: a) install the community version and play with ($0) b) buy a professional license to play with (3460 €/Year) Which one do you pick? Well, the same goes. It turns out, renting a server large enough to run big (useful, > 8B) models is actually quite expensive. The per-api-call costs of real models (like GPT4) adds up very quickly once you're doing non-trivial work. If you're just messing around with the tech, why would you pay $$$$ just to piss around with it and see what you can do? Why would you not use a free version running on your old PC / mac / whatever you have lying around? > I used to be excited about running models locally That's an easy position to be one once you've already done it and figured out, yes, I really want the pro plan to build my $StartUP App. If you prefer to pay for an online service and you can afford it, absolutely go for it; but isn't this an enabler for a lot of people to play and explore the tech for $0? Isn't having more people who understand this stuff and can make meaningful (non-hype) decisions about when and where to use it good? Isn't it nice that if meta released some 400B llama 4 model, most people can play with it, not just the ones with the $7000 mac studio? ...and keep building the open source ecosystem? Isn't that great? I think it's great. Even if you don't want to play, I do. |
"What if I want to play around with really PERSONAL stuff."
I've been keeping a digital journal about my whole life. I plan to throw that thing into an AI to see what happens, and you can be damn sure that it will be local.