Hacker News new | ask | show | jobs
by vagrantJin 51 days ago
You definitely have a bone to pick. Chinese researchers usually have given the world the most cheap and consistent high quality research around LLMs. They don't pretend, they do the work and release the goodies. Mostly so cheap, every one in the world has a chance to use close to frontier models. Why would you respond with "Anger"?

You let us know what your real complaint is about and let's not feign indignation at open models and research.

1 comments

You're making completely unfounded assumptions about me. I use Chinese models myself.
Anthropic and OpenAI took your data, trained their model, and tell you "we are not going to tell you anything how we trained our models, we are not giving your the weights our models, you will have to pay us to access the model trained from your data".

they took your rights and your data.

Chinese labs took your data, trained their model, and tell you "this paper details how our models are trained using your data, here is the final weights of our model trained from your data, feel free to use it for what you want, it is your model trained on your data".

they converted your data, everything is still in your hand under your control.

you couldn't see the difference?

Your specific question can actually be translated as -

1. why people don't stop Chinese labs so US monopoly can be maintained?

2. why people don't stop Chinese labs providing free models to those who would otherwise never be able to afford the same $200 USD/month Anthropic and OpenAI subscriptions.

3. why people don't complain Chinese labs publishing those trillion dollar secret ideas on model training.

well, because most people are not dickhead I guess?

Hold up. Look, this is all shades of grey but saying Chinese labs all release open weights stuff is kinda crazy thing to say.

Right now they are doing that because they are still trying to catch up to Anthropic, Google, and OpenAI.

The moment they have the special sauce, they will shut it down and you won't be able to run their stuff anymore outside of them. Why do I say that? We already have the evidence in the diffusion model arena. All the chinese labs were pumping out open weights models for image and video, the moment they got to SOTA, they stopped doing it. Less and less is being released.

Chinese companies aren't doing open weights models out of the goodness of their hearts, they are doing it because it help their entire industry catch up. Don't get it twisted, this is very much a US vs China battle here. China wants to win and I am not sure how they won't. Deepseek is the first major large model trained on Huawei chips. It won't be the last and I am betting that China will make up for lesser performance of those chips with more manufacturing and power generation.

I am very bullish on China winning the AI war here. But I also am not naive enough to think that the Chinese companies is doing open weights out of wanting to make the world a better place or the goodness of their hearts. It undercuts the american AI companies.

Now we get to the nub. American anti-Chinese rhetoric. Very good.
I made no such claims. Maybe you have something to share about why we need to have a negative view of free and open models based on publicly available frontier research.