Hacker News new | ask | show | jobs
by odo1242 88 days ago
For a less biased source, check out BSBench (where Claude dominates, and the highest rating GPT is 2x worse): https://petergpt.github.io/bullshit-benchmark/viewer/index.v...