Hacker News new | ask | show | jobs
by semicolon_storm 512 days ago
Are you referring to the distilled models?
1 comments

yes, they are not r1
Can you explain what you mean by this?
For example, the model named "deepseek-r1:8b" by ollama is not a deepseek r1 model. It is actually a fine tune of Meta's Llama 8b, fine tuned on data generated by deepseek r1.