| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by 127 69 days ago
	I get 150t/s peak, 120t/s avg with Qwen3.6 27B Q4 with a 4090 on Linux. Now that MTP has landed into llama.cpp.