| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by tensegrist 31 days ago
	reasoning itself just affords the model a ton of extra forward passes / "time to think" the, como se dice, "misalignment" between the content of reasoning tokens and the actual output following the end of the reasoning is a separate problem, extensively studied by e.g. Anthropic