Hacker News new | ask | show | jobs
by TacticalCoder 14 days ago
Google needs to catch up on what? Devs mindshare? The latest Opus 4.8 carefully selected benchmarks made sure to pick Gemini 3.1 Pro and not Gemini 3.5 Flash: 3.5 Flash is beating Opus 4.8 on several of the benchmarks Anthropic posted but simply was ignored.

I don't think SOTA-wise Google has a lot of catch up to do.

1 comments

Gemini 3.5 Flash is not good at coding in practice. Gemini 3.1 Pro too, in particular is known to be bad at tool calls. Many companies would love to have alternatives to Claude Code (as it's a significant risk to depend on one vendor), so far most of the buzz is about moving to Codex but much fewer talk about moving to Gemini. All these benchmarks are not very informative, the Chinese labs do better on these benchmarks than in practice, for example.