| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bt1a 518 days ago
	How excellent for a quantized 27GB model (the Q6_K_L GGUF quantization type uses 8 bits per weight in the embedding and output layers since they're sensitize to quantization)