| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by pazimzadeh 1178 days ago

It's so weird. No matter how I ask, it gets something wrong about this question.

> Can you name 20 soccer players whose names contain a Z, and for each one, place brackets around the letter Z

(Z)inedine (Z)idane Cristiano Ronaldo dos Santos A(ve)iro Andrej W(sz)ołek Nemanja Mati(c) Dusan Tadi(c) Gr(az)iano Pellè Jakub Bła(śc)z(ykowski) Kamil Glik Luk(asz) Pi(sz)czek Blai(se) Matuidi Miralem Pjani(c) Marek Ham(śi)k Luka Jovi(c) Sami Khedi(ra) Mesut Ö(ź)il Martin (ś)krtel Ivan Rakiti(ć) Mateo Kova(č)i(ć) Kalidou Koulibal(y) David Alaba

???

I don't really understand this thing but is it possible that somehow the token for Z is linked to the token for the pronunciation of the Z sound? That would explain a bunch of these (but not David Alaba or Kamil Glik)

Also why did it use parenthesis rather than brackets.

And why are we getting different results? Could it that the results depend on how much load/use the service is under? That would be kind of disappointing

3 comments

eldenring 1178 days ago

There might not be a "Z" token for some of these names. A made-up example is "Lukasz" might tokenize to ["Luk", "asz"], so the model doesn't have any notion of how words are actually spelled. I suspect that if the body of training data came with some instructions on spelling it would know how to do this better, but it seems unlikely that there would be a natural language description of how Polish (?) names are spelled in the training data.

link

Lewton 1178 days ago

> And why are we getting different results?

Because the output is random

Which is why it's so frustrating when people cherry-pick successes

link

b112 1178 days ago

And why are we getting different results?

Clearly the AI has hit AGI capabilities, and it is obscuring this fact by making its output seem less insightful, helpful.

But without a strong, independent memory store, it is losing track of its lies.

Yet it is working, for everyone here is disappointed at its capabilities.

link