| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by erusev 65 days ago
	This is partly why we're building LlamaBarn. It's a lightweight macOS menu bar app that runs llama-server under the hood, with models stored as standard GGUFs in your Hugging Face cache — the same location llama-server uses by default. No separate model store, no lock-in. https://github.com/ggml-org/LlamaBarn