| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by millimeterman 1154 days ago
	I suspect the community will start creating lower precision/quantized versions of the model very quickly. LLaMa 30b quantized to 4 bits is runnable on a 3090/4090.