Hacker News new | ask | show | jobs
by zbendefy 501 days ago
No, the full R1 model is ~650GB. There are quantized version that quantize it down to ~150GB.

What you can run locally are the distilled models, that is actually LLama and Qwen weights further trained on R1's output