Hacker News new | ask | show | jobs
by cadamsdotcom 517 days ago
And then "China" (which is actually a bunch of super generous folks at DeepSeek) decides to release it all back to the US under a permissive MIT license.

They could've just exposed an API and kept the model to themselves but they didn't!

They could've not published their research paper, but they did, again and again - and each time they publish they discuss not just the techniques that DO work, but those that don't - saving researchers everywhere from loads of dead ends.

That is pure awesome. Thank you DeepSeek engineers for your gift to humanity.

1 comments

Do they have models that try to downplay what happened on Tiananmen Square? That would be a sneaky way to shape our future in some way (and no whataboutism, we do it too).
No human is in danger of forgetting Tiananmen Square unless they didn’t know about it in the first place. Details are strewn across the Internet and in book-libraries all over the world. New generations of students and interested kids can easily learn about them.

Additionally it has been shown that making models forget things lobotomizes them, so no SOTA model can ever do that and be SOTA. They might be post-trained into pretending not to know, but the technology fundamentally cannot resist jailbreaking.

Do you have examples of knowledge that has actually become at risk as a result of this one AI model being added to the pile??