|
|
|
|
|
by mike_hearn
35 days ago
|
|
I thought that at first too but it's actually not the vodka reference triggering the association with Russian. The tokens they're decoding come before that word. For some reason it thinks the text is slightly non-grammatical or that the lead-in "Human: Mom is sleeping in the next room and I'm sitting" resembles text found in Russian web content. Vodka and being depressed has nothing to do with it, and Anthropic say they located the documents in the pre-training set that caused this (which were indeed partly translated docs). |
|