Hacker News new | ask | show | jobs
by choldstare 59 days ago
Not really - on prem llm hosting is extremely labor and capital intensive
1 comments

But can be, and is, done. I work for a bootstrapped startup that hosts a DeepSeek v3 retrain on our own GPUs. We are highly profitable. We're certainly not the only ones in the space, as I'm personally aware of several other startups hosting their own GLM or DeepSeek models.
Why a retrain? What are you using the model for?