| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by letitgo12345 1183 days ago
	Even the best Google models seem to be lagging for reasoning tasks vs OpenAI ones at the moment - see the graphs at https://github.com/suzgunmirac/BIG-Bench-Hard