| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Moosdijk 185 days ago
	I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)

1 comments

That's not how these models work.

Thinking models produce thinking tokens to reason out the answer.