Hacker News new | ask | show | jobs
by suddenlybananas 445 days ago
>They forced Claude to plan around a line ender that doesn't rhyme and it did. Claude didn't choose the word green, anthropic replaced the concept it was thinking ahead about with green and saw that the line changed accordingly.

I think the confusion here is from the extremely loaded word "concept" which doesn't really make sense here. At best, you can say that Claude planned that the next line would end with the word rabbit and that by replacing the internal representation of that word with another word lead the model to change.

1 comments

I wonder how many more years will pass, and how many more papers will Anthropic have to release, before people realize that yes, LLMs model concepts directly, separately from words used to name those concepts. This has been apparent for years now.

And at least in the case discussed here, this is even shown in the diagrams in the submission.

We'll all be living in a Dyson swarm around the sun as the AI eats the solar system around us and people will still be confident that it doesn't really think at all.