|
|
|
|
|
by zbendefy
498 days ago
|
|
Note: you are probably running a distilled version of R1, which is actually LLama or Qwen further trained on the input/output of R1. The full R1 is huge (~700GB), altough there are still quantized versions, the smallest one is around 150gb (1.58bit) |
|