Hacker News new | ask | show | jobs
by rmbyrro 621 days ago
Do you have a ballpark idea of how much RAM would be necessary to run llama 3.1 8b and 70b on 8-quant?
1 comments

Roughly, at Q8 the model sizes translate to GB, so ~3 and ~70GB.
You mean 8, not 3?
Yes, apologies, can't edit now