Y
Hacker News
new
|
ask
|
show
|
jobs
by
odo1242
88 days ago
For a less biased source, check out BSBench (where Claude dominates, and the highest rating GPT is 2x worse):
https://petergpt.github.io/bullshit-benchmark/viewer/index.v...