| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by whatever1 944 days ago
	The thing is that a LLMs can point out a logic error in their reasoning if specifically asked to do so. So maybe OpenAI just slapped an RL agent on top of the next-token generator.