| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by visarga 860 days ago
	ollama swaps models from the local library on the fly, based on the request args, so you can test against a bunch of models quickly

1 comments

eclectic29 860 days ago

Once you've tested to your heart's content, you'll deploy your model in production. So, looks like this is really just a dev use case, not a production use case.

link

silverliver 860 days ago

In production, I'd be more concerned about the possibly of it going off on it's own and autoupdating and causing regressions. FLOSS LLMs are interesting to me because I can precisely control the entire stack.

If Ollama doesn't have a cli flag that disables auto updating and networking altogether, I'm not letting it anywhere near my production environments. Period.

link

eclectic29 860 days ago

If you’re serious about production deployments vLLM is the best open source product out there. (I’m not affiliated with it)

link