Hacker News new | ask | show | jobs
by j_maffe 387 days ago
It is formulaic which is why it surprised me that Sonnet failed it. I don't have access to the other models so I'll stick with Gemini for now.