| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by spiralk 639 days ago
	The Aider leaderboards seem like a good practical test of coding usefulness: https://aider.chat/docs/leaderboards/. I haven't tried Cursor personally but I am finding Aider with Sonnet more useful that Github Copilot and its nice to be able to pick any model API. Eventually even a local model may be viable. This new Gemini model does not rank very high unfortunately.

1 comments

kendallchuang 639 days ago

Thanks for the link. That's unfortunate, though perhaps the benchmarks will be updated after this latest Gemini release. Cursor with Sonnet is great, I'll have to give Aider a try as well.

link

KaoruAoiShiho 639 days ago

It's on the leaderboard, it's tied with qwen 2.5 72b and far below SOTA of o1, claude sonnet, and deepseek. (also below very old models like gpt-4-0314 lol)

link

spiralk 639 days ago

It is updated actually, gemini-1.5-pro-002 is this new model.

link

kendallchuang 639 days ago

That was fast, I missed it!

link