| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Patrick_Devine 357 days ago
	Ollama only uses llamacpp for running legacy models. gpt-oss runs entirely in the ollama engine. You don't need to use Turbo mode; it's just there for people who don't have capable enough GPUs.