| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by m3kw9 49 days ago
	FrontierCode is likely paid for by anthropic.

2 comments

lanthissa 49 days ago

did they not pay them enough to get good ratings on the other 3 models?

whats the logic in claiming its a borked metric when everything listed is an anthropic model.

link

Narretz 49 days ago

There a few benchmarks out there where all existing models have abysmal scores. So it's not actually a problem if Antrophic's older models are bad, especially if the jump to the newest model is huge, and the competition is also way below it.

link

reasonableklout 49 days ago

Huh? It's a benchmark by Cognition which (1) is building their own models and (2) offers all providers and thus has an incentive to avoid hyping up any one too much.

link

jstummbillig 49 days ago

But you can just say shit now. Tokens might not be too cheap to meter but saying shit increasingly is.

link