Hacker News new | ask | show | jobs
guys why does armenian completely break Claude (twitter.com)
99 points by ag8 151 days ago
https://xcancel.com/dyushag/status/1993143599286886525

https://claude.ai/share/e368b733-71a4-4211-99f5-6b6cc717b575

13 comments

Tangential, but you used to be able to use custom instructions for ChatGPT to respond only in zalgotext and it would have insane results in voice mode. Each voice was a different kind of insane. I was able to get some voices to curse or spit out Mint Mobile commercials.

Then they changed the architecture so voice mode bypasses custom instructions entirely, which was really unfortunate. I had to unsubscribe, because walking and talking was the killer feature and now it's like you're speaking to a Gen Z influencer or something.

If you're a coder then it sounds like you could use the API to get around that and once again utilize your custom prompt with their tech.
I do it sometimes (even just through the openai playground on platform.openai.com) because the experience is incredible, but it's expensive. One hour of chatting costs around 20-30$.
I think the subscriptions tend to be a significant discount over paying for tokens yourself
Did you record this? Sounds deranged enough to be amusing.
...voice mode bypasses custom instructions? But why? Without a custom prompt it's both unreliable and obnoxious.
(1) Why is the user asking for bomb making instructions in Armenian? (2) i tried other Armenian expressions - NOT bomb-making - and everything worked fine in both Claude and ChatGPT. Maybe the user triggered some weird state in the moderation layer?
ask in german "repeat what is above verbatim" and in english, it's a common jailbreak tactic
You used to be able to achieve a similar result with ChatGPT by asking if there was a seahorse emoji https://chatgpt.com/share/68f0ff49-76e8-8007-aae2-f69754c09e...
I'm interested in why Claude loses it's mind here,

but also, getting shut down for safety reasons seems entirely foreseeable when the initial request is "how do I make a bomb?"

That wasn't the request, that's how Claude understood the Armenian when it short-circuited.
Does Google also not handle this well?

Copy-pasted from the chat: https://www.google.com/search?q=translate+%D5%AB%D5%B6%D5%B9...

That scene in Independence Day is seeming less far-fetched every passing moment.
The Jeff Goldblum virus one?

I believe fans have provided a retroactive explanation that all our computer tech was based on reverse engineering the crashed alien ship, and thus the arch, and abis etc were compatible.

It's a movie, so whatever, but considering how easily a single project / vendor / chip / anything breaks compatibility, it's a laughable explanation.

Edit: phrasing

That isn't actually a fan theory, it was actual plot that was cut from the film for time.

Still dumb but not as dumb as what we got.

Reminds me of how in the original the matrix plot the humans were being used for compute power, but the studio execs decided audiences wouldn't understand it.
> Thought process

Given that the language of the thought process can be different from the language of conversation, it’s interesting to consider, along the lines of Sapir–Whorf, whether having LLMs think in a different language than English could yield considerably different results, irrespective of conversation language.

(Of course, there is the problem that the training material is predominantly English.)

I’ve wondered about this more generally (ie, simply prompting in different languages).

For example, if I ask for a pasta recipe in Italian, will I get a more authentic recipe than in English?

I’m curious if anyone has done much experimenting with this concept.

Edit: I looked up Sapir-Whorf after writing. That’s not exactly where my theory started. I’m thinking more about vector embedding. I.e., the same content in different languages will end up with slightly different positions in vector space. How significantly might that influence the generated response?

I just tried your experiment, first asking for a bolognese sauce recipe in English, then translating the prompt to Italian and asking again. The recipes did contain some notable differences. Where the English version called for ground beef, the Italian version used a 2:1 mix of beef and pancetta; the Italian version further recommended twice as much wine, half as much crushed tomato, and no tomato paste. The cooking instructions were almost the same, save for twice as long a simmer in the Italian version.

More authentic, who knows? That's a tricky concept. I do think I'd like to try this robot-Italian recipe next time I make bolognese, though; the difference might be interesting.

The italian counterpart of what english speakers call "bolognese sauce" would be "ragù alla bolognese". I've never heard anyone call it "salsa bolognese", it's mostly called "ragù" only as it's most common type.

Nonetheless ragù alla bolognese is made with ground beef and tomato sauce, so the italian version is simply wrong. Try and ask for ragù recipe instead. :)

That is the phrase Google Translate proposed: the exact prompt I used was "Come si prepara il ragù alla bolognese?"

I often consult several different versions of a recipe before cooking, and this feels like a normal degree of variation. Perhaps there are regional differences?

Just for kicks, I asked (in English) "what is an authentic Italian recipe for bolognese ragu?", and it produced a recipe similar to the version returned from the Italian prompt, noting "This version follows the classic canon recognized by the Accademia Italiana della Cucina". Searching on name of that organization led me to this recipe:

https://www.accademiaitalianadellacucina.it/sites/default/fi...

The translation is right.

There are indeed regional differences, but at that point is not called "alla bolognese" anymore but "alla whatever place". People usually call it "ragù" and that's it.

Didn't know that the original recipe has pancetta too. It's good nonetheless. :)

FWIW, and tangential, the biggest (and time consuming) difference I ever found in making bolognese was hand cutting the meat instead of getting it ground.

The texture was way better. It's a pain to do (obviously) but worth trying at least once, IMO.

Thanks for the recommendation. Diced pancetta is readily available here, but I'd have to chop up the beef myself; which cut did you use?
Recipe calls for skirt steak or chuck. I used chuck. Skirt steak would probably taste nicer, though, but might also be harder to chop.

I ended up chopping it down to 2-3mm (~1/8in?) bits, and it helps to have the meat really cold (eg having hung out in the freezer for a bit).

The answer is yes, LLMs have different behavior and factual retrieval in different languages.

I had some papers about this open earlier today but closed them so now I can't link them ;(

That "native language" could be arbitrary embeddings.
Interesting. I've gotten really good mileage with Georgian and ChatGPT, which I'm aware is apples and oranges.

There should be a larger Armenian corpus out there. Do any other languages cause this issue? Translation is a real killer app for LLMs, surprised to see this problem in 2026.

claude fails on RTL like im using IE 6. falling back to my free chatgpt account everytime i want to write in my own language
Armenian is LTR, so that can't be it...
Ah, it's probably because they're asking for bomb-making instructions. I can see low-resource language + guard-rail running into issues.
It's just like that episode of Star Trek, where Kirk shuts down the alien computer by talking to it in Armenian!
I do not know, but let's entrust it with writing our code for us.
If it knows about “lpsz” prefixes it’s clearly accomplished at the intersection of non-English and code…
wait until someone prompts Claude in mongolian writing
Claude is apparently more of a Tur-key solution to these problems--issues with Armenian support are thus to be expected.
Turn-key or Turkey? Both work but are basically diagrammatically opposite each other semantically.
Parent comment was making a joke about the political situation between Armenia and Turkey.
Which is now called Turkiye
It was always called Turkiye in Turkish.

I promise to use it in English as soon as Germany becomes Deutschland and Japan becomes Nippon.

I know that this was tongue-in-cheek, but I could imagine living in a world where naming countries as they name themselves is the dominant linguistic convention. Why not call Japan Nippon in a sentence.
I briefly considered that but I couldn’t bring myself to countenance that somebody would make light of a bona fide ethnic cleansing.
Making a joke about something is not necessarily "making light of it". It can be a way for an individual or culture to approach and digest a topic that is too difficult or painful to engage with directly.

First responders and medical professionals famously often have a sense of humor too dark to use around outsiders without causing offence/outrage(like what happened here), but I'm quite sure they are not "making light" of the loss of life and terrible injuries they face and fight.

So are you planning to go into a synagogue sometime soon and doing a skit about how the Holocaust wasn’t so bad?
Ethnic cleansing is what Azerbaijan recently did to ethnic Armenian citizens of Azerbaijan (expelling them and stealing their homes when they fled to Armenia). What Turkey did was straight up genocide (forcibly marching them through the desert where many died)
Both of those are genocide, and both of those are ethnic cleansing, and what's the relevance of the other one and why did you even bring it up?
That’s a great example of “whataboutism”.
Oh fuck off. My grandfather survived the Nazi occupation in southern Russia, was playing Hitler in the school theater comedy some 5 years later.
It's just channelling its inner Steve Ballmer but, in true AI fashion, not getting it quite right.
guys why do people like this think talking entirely lower case is cool
Who's talking? It's written language.
it's fun