Y
Hacker News
new
|
ask
|
show
|
jobs
by
Moosdijk
185 days ago
I meant going to the likeliest output (flash) or (iteratively) generating multiple outputs and (iteratively) choosing the best one (thinking/pro)
1 comments
nl
185 days ago
That's not how these models work.
Thinking models produce thinking tokens to reason out the answer.
link
Thinking models produce thinking tokens to reason out the answer.