| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mirsadm 259 days ago
	How are they not SOTA? They're all very similar with ChatGPT being the worst (for my use case anyway). Like adding lambdas and random c++ function calls into my vulkan shaders.

2 comments

oezi 259 days ago

Gemini 2.5 Pro is the most capable for my usecase in Pytorch as well. Large context and much better instruction following for code edits make a big difference.

link

hendersoon 259 days ago

Gemini 2.5 pro is generally non-competitive with GPT-5-medium or Sonnet 4.5.

But never fear, Gemini 3.0 is rumored to be coming out Tuesday.

link

kingstnap 259 days ago

The random people tweets I've seen said Oct 9th which is Thursday. I suppose we will know when we know.

link

dingnuts 259 days ago

based on what? LLM benchmarks are all bullshit, so this is based on... your gut?

Gemini outputs what I want with a similar regularity as the other bots.

I'm so tired of the religious thinking around these models. show me a measurement.