Hacker News new | ask | show | jobs
by millimeterman 1154 days ago
I suspect the community will start creating lower precision/quantized versions of the model very quickly. LLaMa 30b quantized to 4 bits is runnable on a 3090/4090.