| I’m a bit confused. Your reasoning doesn’t align with the data you shared. The startup costs for just messing around at home are huge: purchasing a server and gpus, paying for electricity, time spent configuring the api. If you want to just mess around, $100 to call the world’s best api is much cheaper than spending $2-7k Mac Studio. Even at production level traffic, the ROI on uptime, devops, utilities, etc would take years to recapture the upfront and on-going costs of self-hosting. Self hosting will have higher latency and lower throughput. |
pacman -S ollama
ollama serve
ollama run llama3
My basic laptop with about 16 GB of RAM can run the model just fine. It's not fast, but it's reasonably usable for messing around with the tech. That's the "startup" cost. Everything else is a matter of pushing scale and performance, and yes that can be expensive, but a novice who doesn't know what they need yet doesn't have to spend tons of money to find out. Almost any PC with a reasonable amount of RAM gets the job done.