| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tdba 1114 days ago
	In summary, humans win the Turing test ~2/3 of the time against current SOTA LLMs. One of the more interesting tactics used was to target a weakness of the LLMs themselves: > ... participants posed questions that required an awareness of the letters within words. For example, they might have asked their chat partner to spell a word backwards, to identify the third letter in a given word, to provide the word that begins with a specific letter, or to respond to a message like "?siht daer uoy naC", which can be incomprehensible for an AI model, but a human can easily understand...

3 comments

tornato7 1114 days ago

This is a pretty astute tactic. Because AI models operate on tokens and not characters, it's pretty easy to confuse them when you go down to the character level. Even asking "how many letters in _" is tough for LLMs.

link

miohtama 1114 days ago

So not yet intelligent enough to pass Turing test, which is the goal of the test. I would not be worried about AI doomsday any time soon.

link

TheObviousOne 1114 days ago

Interesting tactics might going the other direction.

Asking to generate in a super human capabilities...

write a 65 pages of poem about X...

link

choeger 1114 days ago

Or just questions about three or four wildly different fields of science, sports and culture you happen to have more than a layman's understanding. If the answers are somewhat plausible, it's probably a model. Or your life partner.

link

8bitsrule 1113 days ago

Or ask two or three historical questions that very few humans would know anything about ... but a well-trained model would.

link

chasd00 1113 days ago

> "?siht daer uoy naC"

That took me a while to figure out and I’m a human… as far as a I know anyway.

link