Hacker News new | ask | show | jobs
by jtefera 529 days ago
The paper mentions that on several occasions the LLM will provide a correct answer but will either take big jumps without justifying them or will take illogical steps but end up with the right solution at the end. Did you check for that?
1 comments

No, I don't know enough math to test the logic, only the check questions against their expected answers in https://anonymous.4open.science/r/putnam-axiom-B57C/data/Put...
Putnam problems need to actually be graded, often the answer itself is trivial.