| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by magicalhippo 265 days ago
	> Last I checked Ollama inference is based on llama.cpp Yes and no. They've written their own "engine" using GGML libraries directly, but fall back to llama.cpp for models the new engine doesn't yet support.