| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kurthr 250 days ago
	Are there quantized (eg 4bit) models available yet? I assume the training was done in BF16, but it seems like most inference models are distributed in BF8 until they're quantized. edit ahh I see it on huggingface: https://huggingface.co/mlx-community/Olmo-3-1125-32B-4bit