Hacker News new | ask | show | jobs
by Moosdijk 185 days ago
I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)
1 comments

That's not how these models work.

Thinking models produce thinking tokens to reason out the answer.