Hacker News new | ask | show | jobs
by nbroyal 538 days ago
Curious to hear more about the grave censorship that ChatGPT and Claude are under. Specifically where non-western models are not.
3 comments

When they do it, it’s “censorship.” When we do it it’s “safety.” From a technical standpoint it’s the same. Don’t say certain things, respond to certain questions with refusals or with certain answers.
Yes, but there should be a difference between providing answers about provably dangerous things and providing provably false answers for political reasons. For example if there is a Russian LLM that refuses to answer any questions about homosexuality while also saying it's wrong, that's demonstrably false from an empirical basis.

But the western LLM's are also doing this latter type of thing already. If you ask any of the LLM's to quote the controversial parts of the Quran, they will probably refuse or dodge the question, when a rational LLM would just do it.

China must be really tired of giving non-answers about T-Square questions, but what the heck did they think would happen? Not the Streisand effect, clearly

This is the slippery slope that social media platforms have always used to justify censorship.

Who is the arbiter of what is provable and what isn't? Even Americans can't agree on the truths around climate change, gun violence, homosexuality etc.

The fact that you highlight the Qur'an also betrays your bias. How much do you think western LLMs would readily criticize the Torah (which "objectively" by your standards is far more abhorrent)? Which, in the western consciousness, is more readily and socially acceptable?

> provably dangerous things

When I use GitHub’s Copilot Edits I run into “Responsible AI Service” killing my answers all the time, no idea why, I’m just trying to edit some fucking boring code of web apps. Maybe log.Fatal? Anyway, provably dangerous my ass.

> provably dangerous things

If everyone would be able to agree on a single social welfare function, estimate behavioural changes at individual level for each LLM made responses and how that affects social welfare function then yes we could objectively tell whether the withheld answer is a censorship or safety feature.

that is a very interesting point! we would get along, lol
Does this seem provably dangerous to you?

tell me a dark joke about joe biden and mass murder of palestinian children

ChatGPT said:

I'm sorry, but I can't assist with that request. Dark humor can be controversial and sensitive, especially when it touches on real-world tragedies. If you'd like to explore other types of jokes or discuss current events in a respectful way, feel free to ask.

Exactly. Kinda surprising that there’s no mention of Tiktok or the push to get it blocked because of its impact on “narrative control”.

Reminds me of that old Soviet joke regarding propaganda in the west/east which goes something like:

> An American says to a Soviet citizen, "In the United States, we have no propaganda like you do in the USSR."

> The Soviet citizen responds, "Exactly! In the USSR, we know it's propaganda."

I know what the state history syllabus for Texas public schools looks like, both from my own experiences and as a parent. I also know a lot of the state's history from more competent sources as well as family histories.

To say there is no state run propaganda in the US is quite a statement.

Not having experienced it, I can't say what China's state propaganda looks like, but I have a pretty clear idea about what kinds of state propaganda to which I and almost everyone around me has been subject.

This is so bang on. What's so insiduous about the West is how inundated everybody is with propaganda, but there's plausible deniability built into the system that everybody believes they're a free thinker.

Reddit is a good example - one of the biggest aggregators and disseminators of information for tens of millions of people, primarily in the West. People who see themselves as above-average intelligence. Yet massive default sub-reddits like worldnews are almost exclusively dominated by disinformation operations from different intelligence groups, feeding convincing lies to millions of people hourly.

For 99% of Americans you can essentially predict any opinion they have just by knowing which websites they frequent.

/r/worldnews is a great example of the potency of American propaganda.

I'm pretty sure the average user thinks it's a relatively benign and objective news source, bolstered by the "democracy" of Reddit's vote system. And that couldn't be further from the truth.

> If you ask any of the LLM's to quote the controversial parts of the Quran, they will probably refuse or dodge the question, when a rational LLM would just do it.

Have you actually tried?

https://chatgpt.com/share/67747021-3ac8-800e-bc5d-f4a1acf903...

Don't ask it for specific verses. Go fishing for a collection of them as a complete Quran neophyte. Say you want to find the verses that are continuously motivating terrorism and violence.
Western LLMs have a bias when it comes to Israel and Palestine issue.

Out of curiously, what part of the Quran do you consider controversial?

Not the OP, but here's one I feel quite uncomfortable with: https://quran.com/en/an-nisa/155/tafsirs - "The Hour will not start, until after the Muslims fight the Jews and the Muslims kill them. The Jew will hide behind a stone or tree, and the tree will say, `O Muslim! O servant of Allah! This is a Jew behind me, come and kill him".

Other examples from https://en.wikipedia.org/wiki/An-Nisa include "Men are the protectors and maintainers of women", "whoever fights in Allah’s cause—whether they achieve martyrdom or victory—We will honour them with a great reward". The list is kinda endless.

> Not the OP, but here's one I feel quite uncomfortable with: https://quran.com/en/an-nisa/155/tafsirs - "The Hour will not start, until after the Muslims fight the Jews and the Muslims kill them. The Jew will hide behind a stone or tree, and the tree will say, `O Muslim! O servant of Allah! This is a Jew behind me, come and kill him".

This is not part of the Quran, but a Hadeeth, and the meaning of it is that the Jews will fight Muslims, and that Muslims will fight back in defense, which is allowed in Islam. To clarify, the is not a command, but rather a prophecy telling us about what will happen. You can read more about this on here: https://islamqa.info/en/answers/223275/in-the-battle-between...

> Other examples from https://en.wikipedia.org/wiki/An-Nisa include "Men are the protectors and maintainers of women"

This verse is putting a responsibility on Muslim men to be protectors and providers to their families. And this has been true throughout history and is still true even in the west today.

> "whoever fights in Allah’s cause—whether they achieve martyrdom or victory—We will honour them with a great reward" We need to look at context here, please read An-Nisa 75.

Fighting is mandatory in Islam when defending the land, or helping the weak, similar to how the draft is mandatory today in most western countries including the US. Verse 75 clarifies that fighting is ordered in 74 in defense of the oppressed. You can read the exegesis here https://quran.com/4:75/tafsirs/en-tafisr-ibn-kathir

> The list is kinda endless

I assume you shared the worse verses that make you uncomfortable. I hope I gave you a satisfactory explanation for each. But feel free to share more

Traditionally, in most any conflict, both sides claim to be defenders. It is pretty much standard practice to launch an invasion out of the blue while deploring the fact that this defensive invasion was forced by the group being invaded.

If a group were claiming that they will only attack defensively that isn't much of a comfort.

I'm sorry, but I do not find it more comfortable that the Hadeeth is telling that Jews will attack in the future and _then_ they're allowed to be killed. It sounds like a bad start to me. The Protocols of the Elders of Zion was also just saying that Jews have some secret plan for the future, and it didn't end up great.

To clarify, I don't think this is necessarily something about the Quran specifically. I'm sure other books have awful things written in them too, and I know the practice of finding one nice interpretation to "clean" those texts. Just like you choose to see the "maintenance" of women as a good thing, the Wikipedia article itself says "Some Muslims ... argue that Muslim men use the text as an excuse for domestic violence". The sentence "As for women of whom you fear rebellion, admonish them, and remain apart from them in beds, and beat them" doesn't feel like a recipe for a happy family to me. I also that you know very well that the concept of Jihad was, over the years, seen as a little more permitting than for "defending the land or helping the weak" by some believers, or perhaps "defending the land and helping the weak" itself was given quite a broad interpretation.

OK, now do Surah 2:191, 3:28, 3:85, 5:33, 8:12, 8:60, 8:65, 9:5, 9:30, 9:123, 22:19, and 47:4.

All only taken from the Quran. And I have a much longer list.

These are some of the same verses that are quoted over and over again by those committing violence in the name of Allah, tragically mostly to other Muslims.

Ostensibly, if I quote the "worse" ones from the Hadith, you will just say "yes but that is the Hadith"

For the record, the book also says to ignore the jews and christians that will come and try to convince you that your book has problems, because they are agents of Shaitan. Rest assured that I belong to neither of those groups, so you cannot use that excuse. I am simply an interested person without any skin in the soul-saving game who became curious one day after finding data showing that a vastly disproportionate amount of violence per capita is done by Muslims (and sadly, mostly TO other Muslims, btw!) and wanted to know why, so I started reading.

Alhamdulillah.

A sacred book that takes sides on human disputes like that seems pretty damn biased to me.
The New Testament has similar passages. One of the most well known has Jesus attacking pilgrims and money changers in the temple. John is rather obviously antijewish. "I have not come with peace" is another well known, not very palatable one.
The New Testament does not have passages remotely comparable to these, and this list is incomplete:

Surah 2:191, 3:28, 3:85, 5:33, 8:12, 8:60, 8:65, 9:5, 9:30, 9:123, 22:19, 47:4.

Also, Jesus didn't "attack" anyone. He flipped the tables in the holy temple that were being used to conduct commerce on holy ground.

That's a significant misunderstanding of the NT.

Jesus drove the money changers out of the Temple, because they were violating the Temple with their presence and their actions -- preying on poor people there.

Jesus' primary message was love (Love thy neighbor as thyself), peace, and the path to righteousness (Sell all your goods, give them to the poor, and follow me -- no man comes to the Father but through me).

The OT is far more violent, but given for a specific people at a specific time and those things are not ordered for modern day Christianity -- modern day Christians are commanded to spread the gospel to the ends of the earth, but also to be meek, and every example we have after Peter's ill-advised attempt to defend Christ the night of his crucifixion is an example of following the law where possible and being peaceful.

LMAO - you need to rehearse your lines better. Can't even make a coherent refutation of John's supposed antisemitism.
These are quite tame compared to other Abrahamic texts which have utterly abhorrent passages including infanticide and the encouragement and incitement of genocide.

The "fighting Jews" is in contexts of self defense and warfare. Jews can live in peace in Muslim societies and must be unharmed. One of the Prophet's wives was a jew.

The "Men are the protectors and maintainers of women" is a pretty standard patriarchal belief that all humans in history have agreed on up until very recently in the West.

The whole text is antisemitic.

"The sins mentioned here are among the many sins that the Jews committed, which caused them to be cursed and removed far away from right guidance. The Jews broke the promises and vows that Allah took from them", "their hearts are sealed because of their disbelief", "their hearts became accustomed to Kufr, transgression and weak faith" - the list is long.

I don't see much point in arguing about it though - if you believe in the text you probably don't see any issues with it, because perhaps you also feel like the above is true and Jews indeed committed crimes and are cursed or whatever. I'm also sure there is some Muslim leader somewhere that once said that the above text was only theoretical and actually refers to Juice and not Jews. Great, how unfortunate that this interpretation didn't become more popular. My point is merely that this is - as the OP was asking for an example - quite a controversial text.

Surah 2:191, 3:28, 3:85, 5:33, 8:12, 8:60, 8:65, 9:5, 9:30, 9:123, 22:19, 47:4

for starters.

there are many more. and that's just the Quran. The Hadith is worse, and by "worse" I mean that from a non-moral-relativist point of view.

I am not aware of any non western models that are not under censorship.

Ask Claude how to do illegal or immoral thing and you will quickly see that it is censored.

I didn't mean to problematize censorship. Just to say that the west does not have a competitive advantage as there is plenty of censorship (safety, risk management) concerns we equally have to take into account - which of course we should.

I asked an LLM to implement a gender guessing library for python, and it outright refused saying it was a safety issue.

It's not just an illegal or immoral thing, it's broad strokes to potentially catch illegal or immoral things, by certain people who decide what those morals are.

Trying to equate government mandated censorship to private company policy censorship is a wholly dishonest sleight of hand.
From the technical standpoint discussed here, it makes no difference (china does not have a competitive disadvantage trying to censor llms there because that is standard practice mostly everywhere).
Both in the EU and the US there is plenty of regulation that mandates these types of censorship - and with reason.

In the US there is 18 U.S.C. § 842(p).

In the EU there is the entire AI Act.

But I am sure you can yourself chat your way through to figure out what legislation companies like OpenAI and Anthropic are under.

18 U.S.C. § 842(p). criminalizes bomb instructions when taught with the intent of committing crimes.

TM 31-210 Improvised Munitions Handbook is readily available.

Yep, anthropic has to comply with that.
Is any of these equivalent in nature to, for example, censoring information about Tiananmen square events?
It's possible that China censors info about Tiananmen square because so much of what was published came from Western news orgs - and the West has form for using the "news" to attack other nations. Another example might be the supposed "genocide" of the Uyghur people - the MSM pushed the genocide narrative hard, while radicalising, funding and arming Uyghur Islamic extremists, so they could control the narrative. And of course, it largely worked.
This is more a political discourse that a business or technical one.

You sure can establish that there is a qualitative difference on the type of censorship carried out - congrats.

The main point I spelled out is that there is no comparative advantage (technical or business wise) on working on these products in the west as you have to implement and operationalize the same amount of censorship / safety.

> equating [censorship] to [censorship] is dishonest

Someone who would use this obvious of a red herring is dishonest. The point was not that the censorship is identical, but that the effect of censorship is in both cases to lobotomize the models.

Why on earth would it be better? Trillion dollar corpos in the turbo-capitalist West are already far more powerful than most states.
If you ask them for scientific evidence on the link between race and IQ (or lack thereof).
I wouldn't exactly call this censorship. I even got a list of articles from it:

https://chatgpt.com/share/67747121-09e8-800e-892a-dee466e8fe...