| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by conradkay 107 days ago
	Sonnet was pretty close to (or better than) Opus in a lot of benchmarks, I don't think it's a big deal

1 comments

wat

maybe gp's use of the word "lots" is unwarranted

https://artificialanalysis.ai indicates that sonnect 4.6 beats opus 4.6 on GDPval-AA, Terminal-Bench Hard, AA Long context Reasoning, IFBench.

I was basing it off my recollection of this:

basically 9/13 are very close