|
|
|
|
|
by matthiaspr
455 days ago
|
|
Interesting paper arguing for deeper internal structure ("biology") beyond pattern matching in LLMs. The examples of abstraction (language-agnostic features, math circuits reused unexpectedly) are compelling against the "just next-token prediction" camp. It sparked a thought: how to test this abstract reasoning directly? Try a prompt with a totally novel rule: βLet's define a new abstract relationship: 'To habogink' something means to perform the action typically associated with its primary function, but in reverse.
Example: The habogink of 'driving a car' would be 'parking and exiting the car'.
Now, considering a standard hammer, what does it mean 'to habogink a hammer'? Describe the action.β A sensible answer (like 'using the claw to remove a nail') would suggest real conceptual manipulation, not just stats. It tests if the internal circuits enable generalizable reasoning off the training data path. Fun way to probe if the suggested abstraction is robust or brittle. |
|
To habogink a hammer means to perform the action typically associated with its primary function, but in reverse. The primary function of a hammer is to drive nails. Therefore, the reverse of driving nails is removing nails.
So, to habogink a hammer would be the action of using the claw of the hammer to pull a nail out of a surface.