Hacker News new | ask | show | jobs
by pona-a 695 days ago
Well, what if it just got better at covering up human-presentable cases?

See this comment [0] on this very post, showing how it makes quite problematic mistakes on larger numbers still.

It's still improvement, but only in the way of imitation. It shows that while clever within their constraints, these models still don't have the capabilities to truly perform computation or "thought". Chain of thought can help, but you there are some things you cannot split into atomic tasks; if the very world model isn't that stellar, no amount of elucidation will compensate for the inaccurate representations within. (i.e. "How would person X react to Y?" If your theory of mind is poor, no amount of further subtasks will help you give a better prediction.)

[0] https://news.ycombinator.com/item?id=41092987

1 comments

For larger numbers it just needs to execute code. Most people also can't calculate such numbers in their head.

It shouldn't have to be able to do things it knows how to use code for. E.g. dumb thing slike how many Rs in a strawberry. It doesn't even see characters, so even if it was somehow possible, it couldn't count for sure.

It is like asking someone who only has ever seen hieroglyphs how many Rs are in a character by character version of strawberry.

Still, let's not anthropomorphize computational processes. It is a function approximate, which we'd expect to pick up on simple patterns like intersections or base10 arithmetic. When we see its predictions diverge from truth, that shouldn't be disregarded with a "just so" story, this is a sign we're pushing the architecture to its limits.