Y
Hacker News
new
|
ask
|
show
|
jobs
by
semicolon_storm
512 days ago
Are you referring to the distilled models?
1 comments
whimsicalism
512 days ago
yes, they are not r1
link
BeefySwain
512 days ago
Can you explain what you mean by this?
link
baobabKoodaa
511 days ago
For example, the model named "deepseek-r1:8b" by ollama is not a deepseek r1 model. It is actually a fine tune of Meta's Llama 8b, fine tuned on data generated by deepseek r1.
link