Hacker News new | ask | show | jobs
by maximge 96 days ago
Cool test, tried to solve problems from B.3 Complete Problem Examples. https://arxiv.org/html/2603.09678v1#A1

Opus 4.6 Extended, solved all of them.

https://claude.ai/public/artifacts/aeb98066-f7a9-455b-9550-6...

https://claude.ai/public/artifacts/b0fcd13f-d222-4b65-bdcf-f...

https://claude.ai/public/artifacts/304650fb-afbf-4a08-9f6b-5...

https://claude.ai/public/artifacts/d00b898c-2265-4a34-a910-9...

Only once there was an incorrect answer, on Hard: H01: Balanced Parentheses. On the second attempt, it was solved. The Josephus Problem turned out to be really easy (meaning solved in short term). Possibly the model got trained on the earlier tasks since I did everything in one chat. As a prompt, I provided the problem statement, except for the first task where I added this description of the language:

Syntax:

Character Instruction Performed > Increment the data pointer by one (to point to the next cell to the right). < Decrement the data pointer by one (to point to the next cell to the left). Undefined if at 0. + Increment the byte at the data pointer by one modulo 256. - Decrement the byte at the data pointer by one modulo 256. . Output the byte at the data pointer. , Accept one byte of input, storing its value in the byte at the data pointer.[b]