| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by bjourne 4 days ago
	LLMs work by generating the most likely continuation to a prompt. But they can also generate multiple likely continuations. This create multiple branches which in turn can generate even more branches. The LLM can then evaluate the branches, prune the unpromising ones, and merge the best ones. More branches means more tokens, means more effort.

1 comments

simianwords 4 days ago

this has nothing to do with the thinking effort however

link

bjourne 4 days ago

Yes, it does. Breadth of search is exactly what the effort setting controls.

link

pyentropy 4 days ago

LLM-judge/parallel branching ≠ multi-token prediction ≠ reasoning effort.

See https://developers.openai.com/cookbook/articles/openai-harmo... and src/openai/types/shared/reasoning_effort.py

link

bjourne 3 days ago

[flagged]

link

simianwords 3 days ago

No it doesn't and lets not call people names. You can verify this using ChatGPT or anything else. You are mistaken and there are no "branches" happening.

link

bjourne 2 days ago

[flagged]

link

FergusArgyll 9 hours ago

I think you may be confusing the openai "pro" series models with thinking. Thos are rumored to be multi "branched"

link