| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by brucethemoose2 879 days ago
	Its not really 2 bits. Modern quantization schemes are almost like lossy compression algorithms, and llms in particular are very "sparse" and amenable to compression.