| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by taf2 49 days ago
	I’m waiting to see results on deepswe - that benchmark really seemed accurate for opus and gpt 5.5…