Y
Hacker News
new
|
ask
|
show
|
jobs
by
jszymborski
341 days ago
If Qwen 0.6B is suitable, then it could fit in 576MB of VRAM[0].
https://huggingface.co/unsloth/Qwen3-0.6B-unsloth-bnb-4bit
1 comments
numpad0
340 days ago
or on a single Axera AX630C module:
https://www.youtube.com/watch?v=cMF6OfktIGg&t=25s
link