Hacker News new | ask | show | jobs
by zaphar 35 days ago
Many of the chinese developers are just drafting off of Anthropic, OpenAI, and Gemini though. Distillation is leveraging those models to achieve their leaps in training. It's not obvious to me that Anthropic and do the same. Someone has to build the advanced model to draft off of.
1 comments

Of course Anthropic is doing the same. Just ask Claude in Chinese to introduce himself and tell you what he does and how it can help you. There's a good chance that it will tell you its name is DeepSeek ;)
A hallucination is not an indicator of distillation...