Hacker News new | ask | show | jobs
by fafqg 1149 days ago
You could say the same about many SaaS projects. Why pay for an expensive GPU upfront and then some guy who can install it, configure it, create some sort of interface for you to talk to it... when you can just pay openai to do it for less money?
1 comments

Because GPU-Servers that can run a typical LLM are less than 5k, which includes installation. Really, running an LLM seems to be no more complicated than running a NAS from a system administrators perspective.

Emphasis on running, not training.

5k can't even get you 80 GB of VRAM on a GPU. How is that possible?