| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by fancyfredbot 433 days ago
	They used deep neural networks, reinforcement learning, and Monte Carlo tree search. All except the MCTS are critical components of modern LLMs. MCTS is a form of planning which you can argue has parallels to "reasoning" models, although that's pretty tenuous I admit.