Hacker News new | ask | show | jobs
by rocho 493 days ago
That's not DeepSeek, it's a Qwen or Llama model distilled from DeepSeek. Not the same thing at all.