| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by odo1242 97 days ago
	Nope, I use GitHub Copilot (agentic mode) and I end up having to use the (more expensive) Claude model because ChatGPT never second-guesses me or even itself. Gemini is slightly worse though.

1 comments

For a less biased source, check out BSBench (where Claude dominates, and the highest rating GPT is 2x worse): https://petergpt.github.io/bullshit-benchmark/viewer/index.v...