Y
Hacker News
new
|
ask
|
show
|
jobs
by
XCSme
39 days ago
Gpt 5.5 is quite a big leap, it's a lot better than opus 4.7 for agentic coding
2 comments
energy123
39 days ago
Arena only allows very small context sizes, so it's a noisy benchmark for what we care about IRL.
link
mettamage
39 days ago
Better in what ways? I'm just curious about your experience.
link
XCSme
39 days ago
Consistency, not making mistakes.
link
mettamage
39 days ago
Ahh... that is indeed an issue I have with Claude. I'll check it out!
link