Hacker News new | ask | show | jobs
by anilgulecha 86 days ago
Claude models have made very good progress (see BS benchmark), and that probably explains why they're leading now. others will follow this precedent shortly, no doubt.

https://petergpt.github.io/bullshit-benchmark/viewer/index.v...