| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fy20 1045 days ago
	You can probably run it locally with llama.cpp using CPU only, but it will be slow. I have a couple year old laptop with a RTX 3060 and it runs pretty well split across the CPU and GPU.