| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by anilgulecha 133 days ago
	Claude models have made very good progress (see BS benchmark), and that probably explains why they're leading now. others will follow this precedent shortly, no doubt. https://petergpt.github.io/bullshit-benchmark/viewer/index.v...