Hacker News new | ask | show | jobs
by dchichkov 508 days ago
Sorry, but this was ChatGPT/o1 with access to code execution (Python) and it used almost 4 minutes to do reasoning. It had done a few checks with smaller numbers, all of which had failed. And it proceeded to make a wrong conclusion (with high confidence).
1 comments

Of course it failed. Tell it to write a program.