| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by guywhocodes 974 days ago
	Seems to be running on llama.cpp, so it's going to be a question of performance. I don't have any M-cpu but on my 13th gen i5 I can run mistral at about 6.5 tokens per second. Which seems comparable to what this is.