Hacker News new | ask | show | jobs
by titaniczero 1214 days ago
They are language models, it's what they do best. They "understand" (give more weight) to the correct relations between words (tokens) and try to predict the next token based on previous tokens. So when you feed it with instructions, for the next tokens the model will give more weight to the tokens related with those instructions.

On the other hand, they can't handle actual logic, reasoning, etc.

1 comments

>On the other hand, they can't handle actual logic, reasoning, etc.

what is "actual reasoning"?

Another HN user posted this (https://imgur.com/HOEnxYb) response to the prompt: "is throwing a rubber duck into the ocean a effective way to communicate with my brother who is on a pirate ship"

If you weren't told that this was "just" the result of a LLM plugged into a chat bot then surely you'd conclude that, especially given the short prompt, who/whatever generated the response demonstrated some type of logic or reasoning, right?

I get the impression that this tech is more than the sum of its parts.

>what is "actual reasoning"?

It can't play chess and by can't play I don't mean it is bad, I mean it doesn't follow the rules. Following chess rules requires little reasoning but it does need it. You look at the board, you choose a piece you want to move, then you look at the squares you are allowed to move to and then you check whether this results in a check. It is an illegal move to intentionally check yourself by moving the king so that it can be captured or to move a piece that prevents the capture of the king without removing the threat.

https://youtu.be/rSCNW1OCk_M

https://youtu.be/iWhlrkfJrCQ

The plot twist in the second video is just ridiculous. ChatGPT gets schizophrenia and doesn't even remember which color it plays.

I like to try giving logical puzzles to these chat bots to see what they say. I don't really expect them to do the reasoning to come to an answer, but it's interesting to see that they just make up new rules to satisfy some answer it thinks I want:

Prompt: Sally will go to the dance if James goes to the dance. Michael will go to the dance if Julie will go to the dance. James is sick and can't go to the dance. Will Michael go to the dance?

>No, Michael will not go to the dance. If James is sick and can't go, then Sally won't go either. And if Sally won't go, then Julie won't go, which means that the condition for Michael to go to the dance (that Julie will go) is not satisfied.

I think 'actual reasoning' would mean ChatGPT can solve (albeit contrived) puzzles like this. It can't, at least not reliably from my poking and prodding.

When prompted with:

> Sally will go to the dance if James goes to the dance. Michael will not go to the dance if Julie will go to the dance. James is sick and can't go to the dance. Julie and James always go to dances together. Will Michael go to the dance?

ChatGPT responds with:

> Michael will not go to the dance, since his attendance is contingent on Julie not going and James being unable to attend due to illness does not change that condition. Given that James and Julie always attend dances together, it is likely that Julie will also not go to the dance, meaning that Michael's condition for attending will be met, and he will not go to the dance.

It's really good at fibbing, but not so good at 'actual reasoning' (whatever that may be).

I got:

> We know that Sally will go to the dance if James goes to the dance, and James is sick and can't go to the dance. Therefore, we can conclude that Sally will not go to the dance.

> Since Michael will go to the dance if Julie will go to the dance, we don't have enough information about Julie's attendance to the dance. We only know that James is sick and can't go to the dance, so it's possible that Julie could still go to the dance. If Julie does go to the dance, then Michael will go as well. However, if Julie doesn't go, then Michael may or may not go to the dance, depending on his personal decision.

That strikes me as more nuanced than either of the other two. Worth mentioning I'm paying for the Plus subscription, though, and we just got a new "Turbo" model that answers faster. I think that model may be allowed a little more power as well, so the answer quality might be slightly better.

my 8 year old would fail at your puzzle. is an 8 year old capable of "actual reasoning"?

this[1] was an interesting read. Particularly the 'Emergent Prompting Strategies' and 'chain-of-thought prompting'.

I think we're a long way from sentient AI, but there is a real sense of "something" unusual and heretofore not achieved in computing. The responses to logic queries are a long way ahead of statistically driven word mashing.

[1] https://ai.googleblog.com/2022/11/characterizing-emergent-ph...

Ask ChatGPT to turn math word problems into executable JavaScript. Then evaluate the JavaScript. Suddenly ChatGPT is much better at math.
But that only proves the limitations are there.
Yes, LLMs are different. They are not reliable computers and computers are not reliable translators.

Perhaps pick the best tool for the job?

Or just flail around wildly and leave snarky comments on articles about bullshit generators. Your choice.