| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by adrian_b 2 hours ago

I doubt that it has ever been possible to obtain enough output tokens from OpenAI or Anthropic to be useful for training other LLMs.

In any case, had that been possible in the beginning, it stopped being possible long ago, because any suspicious accounts would be banned and the cost would be prohibitive even if they were not banned.

On the other hand, anyone can train new LLMs using the open weights Chinese LLMs, or the much fewer open weights LLMs with other origins, like the NVIDIA LLMs.

So in reality it is much more plausible for a US company to use Chinese LLMs for training, than vice versa.

2 comments

ivanovm 8 minutes ago

it is certainly possible and being done all over the place. there's a black market that chinese labs use to buy frontier american llm trajectories by the millions through US intermediaries. they're not even particularly shy about it, i have been offered $0.7 per opus 4.8 call

there's also a market for chinese labs sending checkpoints to US companies to be trained on US compute and sent back

i'm surprised that so many people take chinese tech reports about how they train their models at face value tbh

NooneAtAll3 1 hour ago

> So in reality it is much more plausible for a US company to use Chinese LLMs for training, than vice versa.

that's... exactly the point?

make it easy to steal tech from opponent, not from you