| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by number6 92 days ago
	But can it count the R's in strawberry?

2 comments

Paradigma11 92 days ago

That question is equivalent to asking a human to add the wavelengths of those two colors and divide it by 3.

link

snovv_crash 92 days ago

Unless you're aware of hyperspectral image adapters for LLMs they aren't capable of that either.

link

szszrk 92 days ago

Unfair - human beats AI in this comparison, as human will instantly answer "I don't know" instead of yelling a random number.

Or at best "I don't know, but maybe I can find out" and proceed to finding out/ But he is unlikely to shout "6" because he heard this number once when someone talked about light.

link

koliber 92 days ago

> human will instantly answer "I don't know" instead of yelling a random number.

Seems that you never worked with Accenture consultants?

link

szszrk 92 days ago

Fair.

Yet this can be filtered with fixed rules, like "output produced by corporate structures is untrusted random data".

link

thegabriele 92 days ago

Why is that?

link

Paradigma11 92 days ago

Because LLMs dont have a textual representation of any text they consume. Its just vectors to them. Which is why they are so good at ignoring typos, the vector distance is so small it makes no difference to them.

link

Aditya_Garg 92 days ago

yes its ridiculously good at stuff like that now. I dare you to try and trick it.

link

frizlab 92 days ago

https://news.ycombinator.com/item?id=47495568

link

thedatamonger 92 days ago

what bothers me is not that this issue will certainly disappear now that it has been identified, but that that we have yet to identify the category of these "stupid" bugs ...

link

sigmoid10 92 days ago

We already know exactly what causes these bugs. They are not a fundamental problem of LLMs, they are a problem of tokenizers. The actual model simply doesn't get to see the same text that you see. It can only infer this stuff from related info it was trained on. It's as if someone asked you how many 1s there are in the binary representation of this text. You'd also need to convert it first to think it through, or use some external tool, even though your computer never saw anything else.

link

Measter 92 days ago

> It's as if someone asked you how many 1s there are in the binary representation of this text.

I'm actually kinda pleased with how close I guessed! I estimated 4 set bits per character, which with 491 characters in your post (including spaces) comes to 1964.

Then I ran your message through a program to get the actual number, and turns out it has 1800 exactly.

link

sigmoid10 90 days ago

>I estimated 4 set bits per character, which with 491 characters in your post (including spaces) comes to 1964

And that's exactly the kind of reasoning an LLM does when you ask it about characters in a word. It doesn't come from the word, it comes from other heuristics it picked up during training.

link

datsci_est_2015 92 days ago

Okay but, genuinely not an expert on the latest with LLMs, but isn’t tokenization an inherent part of LLM construction? Kind of like support vectors in SVMs, or nodes in neural networks? Once we remove tokenization from the equation, aren’t we no longer talking about LLMs?

link

fenomas 92 days ago

It's not a side effect of tokenization per se, but of the tokenizers people use in actual practice. If somebody really wanted an LLM that can flawlessly count letters in words, they could train one with a naive tokenizer (like just ascii characters). But the resulting model would be very bad (for its size) at language or reasoning tasks.

Basically it's an engineering tradeoff. There is more demand for LLMs that can solve open math problems, but can't count the Rs in strawberry, than there is for models that can count letters but are bad at everything else.

link