| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by lovemenot 1165 days ago

This argument is imbalanced. Not necessarily wrong.

To follow scientific method, you should specify what would be meaningful according to your parameters: not merely parroting. If and when that is achieved by LLMs, instead of moving the goalposts, accept that something genuine has happened.

>> We will be flooded with content that will get harder and harder to prove wrong, but it will be wrong.

I mostly agree, especially about quantity. But since you wish to draw a line in the sand, define quality. Or truth.

2 comments

jarek83 1165 days ago

>> But since you wish to draw a line in the sand, define quality. Or truth.

I asked it about tourists attractions in a city I know. One of the items it listed was a Zoo. There is no Zoo there. It seemed like it just took the majority of the question into account at some point and ignored the city name itself and since most questions like that are about big cities that have zoos, it went ahead to suggest it, since it was among probable answers.

It will get better with more input and will get this kind of obvious questions right. But I believe it will never truly 'know' what it is talking about. And we will start to believe it does.

link

fl7305 1165 days ago

Exactly.

Or to put it differently: Is "GPT-4 is just a parrot" a falsifiable claim?

If so, what does the test look like that would falsify it?

link

jarek83 1165 days ago

I guess debugability will be its most desired feature soon. Let's see what copyrights will do with it.

link

fl7305 1164 days ago

What do "debuggability" or "copyrights" have to do with:

Is "GPT-4 is just a parrot" a falsifiable claim?

If so, what does the test look like that would falsify it?

Can you think of questions to test GPT-4s ability to reason that 90% of human 12-year olds will answer correctly, but where GPT-4 constantly struggles?

link

jarek83 1164 days ago

Look at my previous comment about a city without a Zoo.

link