Hacker News new | ask | show | jobs
by ipsum2 511 days ago
Title is wrong, only the distilled models from llama, qwen are on ollama, not the actual official MoE r1 model from deepseekv3.
2 comments

Sorry about that. We are currently uploading the 671B MoE R1 model as well. We needed some extra time to validate it on Ollama.
The naming of the models is quite confusing too...
Did you mean the tags or the specific names from the distilled models?
the 671B model is now available:

4 bit quantized: ollama run deepseek-r1:671b

(400GB+ VRAM/Unified memory required to run this)

https://ollama.com/library/deepseek-r1/tags

8 bit quantization still being uploaded