Hacker News new | ask | show | jobs
by sweezyjeezy 422 days ago
o4-mini got this right 4 times out of 4.
1 comments

o4 got this wrong multiple times. claude 3.7 got it right the first time