Hacker News new | ask | show | jobs
by diegof79 250 days ago
I just tried Opus 4.1=Pass (after a self correction in its answer), Gemini 2.5 Flash=Pass (surprised that it gave the correct answer immediately)