Hacker News new | ask | show | jobs
by jw1224 433 days ago
Are you sure about that? It correctly solves the puzzle.

Each step (though a little confusing to parse) does make sense in context.

2 comments

> Each step (though a little confusing to parse) does make sense in context.

The proposed intermediate answers don't make sense, and they're the actual puzzle. You must answer all the intermediates correctly, you can't skip levels.

The following ones clearly aren't correct (I think, I hadn't actually done that puzzle):

["the of nowhere"] -> middle; the model gets that right, but garbles the answer due to not correctly extracting the clue and having trailing garbage

[your [middle] one can get you grounded] -> finger; "room" as suggested by model makes no sense

[two-dimensional [finger] puppet projected on a wall] -> shadow; the model gets this right, but only by ignoring the clue completely, which means the justification is nonsense

[dial ([shadow]-based clock)] -> sun; the model says sundial, which then forces it to make an inane "a sundial represents the sun" argument at the next level.

[[red] spandex halloween costume, maybe] -> devil; the model doesn't give an answer word at all for this, but just says that "a red spandex costume is not a pitchfork"

The first line is already nonsense. The answer is obviously not "room".

Getting the correct final answer tells you nothing about the reasoning. The LLM will solve the puzzle even if you only pass it the sentence "the [] de Milo is discovered by a Greek []".