Hacker News new | ask | show | jobs
by taf2 2 days ago
I’m waiting to see results on deepswe - that benchmark really seemed accurate for opus and gpt 5.5…