| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by UltraSane 474 days ago
	This paper [1] even claims that "models primed with incorrect solutions containing proper reasoning patterns achieve comparable performance to those trained on correct solutions." [1] https://arxiv.org/abs/2503.01307