Hacker News new | ask | show | jobs
by xgstation 517 days ago
It is really fun and sarcastic to watch all this happening. U.S. Gov. tried to block China from accessing GPU resources very hardly to stop their AI development, but actually helped China to take a leap on developing efficient and more cost-effective LLM model with constraint GPU access.
3 comments

And then "China" (which is actually a bunch of super generous folks at DeepSeek) decides to release it all back to the US under a permissive MIT license.

They could've just exposed an API and kept the model to themselves but they didn't!

They could've not published their research paper, but they did, again and again - and each time they publish they discuss not just the techniques that DO work, but those that don't - saving researchers everywhere from loads of dead ends.

That is pure awesome. Thank you DeepSeek engineers for your gift to humanity.

Do they have models that try to downplay what happened on Tiananmen Square? That would be a sneaky way to shape our future in some way (and no whataboutism, we do it too).
No human is in danger of forgetting Tiananmen Square unless they didn’t know about it in the first place. Details are strewn across the Internet and in book-libraries all over the world. New generations of students and interested kids can easily learn about them.

Additionally it has been shown that making models forget things lobotomizes them, so no SOTA model can ever do that and be SOTA. They might be post-trained into pretending not to know, but the technology fundamentally cannot resist jailbreaking.

Do you have examples of knowledge that has actually become at risk as a result of this one AI model being added to the pile??

I couldn't agree more. China has: * weaker GPUs * a smaller model * started with nothing

And now they're building better, faster, and cheaper models at a fraction of the cost. It's hilarious and exciting.

Speaking of sarcasm, and thinking about your argument, should we start selling them cutting edge GPUs to slow down their research?
I doubt you ur GPU sanctions have had much of an influence one way or the other. They can get their resources from third countries even if they can’t get them directly from the USA. I wonder if the USA will eventually try to lock down higher end NVIDIA GPUs and prevent export all together.