Hacker News new | ask | show | jobs
by baobabKoodaa 508 days ago
For example, the model named "deepseek-r1:8b" by ollama is not a deepseek r1 model. It is actually a fine tune of Meta's Llama 8b, fine tuned on data generated by deepseek r1.