| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by shay_ker 46 days ago
	How does it compare to popular local inference engines, e.g. ollama, lm studio, or handrolled llama.cpp? I saw a brief benchmark in the readme but wasn't sure if there was more.