Y
Hacker News
new
|
ask
|
show
|
jobs
by
reichardt
407 days ago
With around 4.6 GiB model size the new Qwen3-8B quantized to 4-bit should fit comfortably in 16 GiB of memory:
https://huggingface.co/mlx-community/Qwen3-8B-4bit