Y
Hacker News
new
|
ask
|
show
|
jobs
by
kurthr
202 days ago
Are there quantized (eg 4bit) models available yet? I assume the training was done in BF16, but it seems like most inference models are distributed in BF8 until they're quantized.
edit
ahh I see it on huggingface:
https://huggingface.co/mlx-community/Olmo-3-1125-32B-4bit