| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by mrciffa 543 days ago
	Exactly! Uncertainty is critical to correctly evaluate LLM performance and we don't need reasoning models to spend thousands of tokens on simple questions