| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by RA_Fisher 59 days ago
	In what ways? LM Arena has Opus 4.7 w/ 1567 -/+ 7 vs. 1505 -/+ 10 from GPT-5.5 Codex in code. I'm currently using both. Admittedly my recent experience tilts Opus now 4.8, but you and others have my interest piqued re: GPT-5.5 Codex so I'm trying that more now.

1 comments

arena is not a good benchmark, it is very susceptible to sycophancy