Y
Hacker News
new
|
ask
|
show
|
jobs
by
odo1242
97 days ago
Nope, I use GitHub Copilot (agentic mode) and I end up having to use the (more expensive) Claude model because ChatGPT never second-guesses me or even itself. Gemini is slightly worse though.
1 comments
odo1242
97 days ago
For a less biased source, check out BSBench (where Claude dominates, and the highest rating GPT is 2x worse):
https://petergpt.github.io/bullshit-benchmark/viewer/index.v...
link