Hacker News new | ask | show | jobs
by yetanotherjosh 507 days ago
ollama is stating there's a difference: https://ollama.com/library/deepseek-r1

"including six dense models distilled from DeepSeek-R1 based on Llama and Qwen. "

people just don't read? not sure there's reason to criticize ollama here.

1 comments

i’ve seen so many people make this misunderstanding, huggingface clearly differentiates the model, and from the cli that isn’t visible