Hacker News new | ask | show | jobs
by Kim_Bruning 503 days ago
Claude 3.5 Sonnet did rather well.

I get the impression it often does as well or better than o1 on many tasks, despite not being a reasoning model.