| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by kazinator 411 days ago
	That's totally cribbed from some discussion hat occurred in its training.

4 comments

Nevermark 411 days ago

As apposed to humans who all derive the physics of heat transfer independently when given a question like this?

Not picking on you - this brings up something we could all get better at:

There should be a "First Rule of Critiquing Models": Define a baseline system to compare performance against. When in doubt, or for general critiques of models, compare to real world random human performance.

Without a real practical baseline to compare with, its to easy to fall into subjective or unrealistic judgements.

"Second Rule": Avoid selectively biasing judgements by down selecting performance dimensions. For instance, don't ignore difference in response times, grammatical coherence, clarity of communication, and other qualitative and quantitative differences. Lack of comprehensive performance dimension coverage is like comparing runtimes of runners, without taking into account differences in terrain, length of race, altitude, temperature, etc.

It is very easy to critique. It is harder to critique in a way that sheds light.

link

selcuka 409 days ago

> As apposed to humans who all derive the physics of heat transfer independently when given a question like this?

Isn't that the difference between learning and memorizing, though? If you were taught Newton's Law of Cooling using this example and truly learned it, you could apply it to other problems as well. But if you only memorized it, you might be able to recite it when asked the same question, yet still be unable to apply it to anything else.

link

accrual 410 days ago

> It is very easy to critique. It is harder to critique in a way that sheds light.

Well said. This is the sort of ethos I admire and aspire to on HN.

link

mhh__ 411 days ago

So is my knowledge of newtons law of cooling

link

kazinator 411 days ago

If an LLM has only that knowledge and nothing else (pieces of text saying that heat transfer is proportional to some function of the temp difference) such that is not trained on any texts that give problems and solutions in this area, it will not work this out, since it has nothing to generate tokens from.

Also, your knowledge doesn't come from anywhere near having scanned terabytes of text, which would take you multiple lifetimes of full time work.

link

mhh__ 410 days ago

We get way more info than llms do, just not solely from text

link

suddenlybananas 410 days ago

You have not read every accessible piece of text in existence.

link

mhh__ 410 days ago

There is more to life than just text e.g. this is part of lecun argument against LLMs

link

suddenlybananas 410 days ago

Lecun's argument is based off a bad interpretation of how data is processed by the optic nerve, we don't receive that much raw data.

What we do have, is billions of years of evolution that has given a lot of innate knowledge which means we are radically more capable than LLMs despite having little data.

link

kazinator 410 days ago

There is more to text than just predicting tokens based on a vast volume of text.

There isn't an argument "against LLMs" as such; the argumentation is more oriented against the hype and incessant promotion of AI.

link

fph 410 days ago

This exact problem was in Martin Gardner's column for Scientific American in the 1970s. There are surely references all over the internet.

link

jonplackett 410 days ago

If it was just ‘in the training data’ they’d all get it right.

But they don’t.

link

kazinator 410 days ago

I don't think that can be postulated as a law, because they are a kind of lossy compression. Different lossy compressions will lose different details.

link