|
|
|
|
|
by hnfong
470 days ago
|
|
The DeepSeek R1 distilled onto Llama and Qwen base models are also unfortunately called “DeepSeek” by some. Are you sure you’re looking at the right thing? The OG DeepSeek models are hundreds of GB quantized, nobody is using RTX GPUs to run them anyway… |
|