Hacker News new | ask | show | jobs
by RA_Fisher 12 days ago
In what ways? LM Arena has Opus 4.7 w/ 1567 -/+ 7 vs. 1505 -/+ 10 from GPT-5.5 Codex in code. I'm currently using both.

Admittedly my recent experience tilts Opus now 4.8, but you and others have my interest piqued re: GPT-5.5 Codex so I'm trying that more now.

1 comments

arena is not a good benchmark, it is very susceptible to sycophancy