| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by Avamander 80 days ago
	I feel like the maximum effort mode kind-of wraps around and starts becoming "desperate" to the extent of lazy or a monkey's paw, similar to how lower effort modes or a poor prompt.

2 comments

svnt 80 days ago

I’m going in circles. Let me take a step back and try something completely different. The answer is a clean refactor.

Wait, the simplest fix is the same hack I tried 45 minutes ago but in a different context. Let me just try that.

Wait,

link

dinobones 80 days ago

Wait, the linter re-ordered the file. Let me restore it to the previous state.

whisper: There is no linter.

link

jen729w 80 days ago

Those test failures are pre-existing. We're all done!

link

xnorswap 80 days ago

Wait, I should check if they pre-exist on master.

    < 1,000 prompts for compound cd && git commands that can't be safely auto-accepted >

link

richardjennings 80 days ago

I think over-thinking is only solved by thinking more, not less. This is only viable once some intelligence threshold is reached, which I think Anthropic has borderline achieved.

link

thesz 79 days ago

  > I think over-thinking is only solved by thinking more, not less.

Despite "thinking" tokens being determined by the preceding tokens, they still are taken from some probability distribution, just a complex one. This means that at each token selection step there is a probability P_e of an error, of selecting a wrong token.

These errors compound exponentially: the probability of not selecting wrong token for N steps is 1-(1-P_e)^N.

The shorter "thinking" is, the less is the probability of it going astray.

link

richardjennings 79 days ago

> The shorter "thinking" is, the less is the probability of it going astray

As long as the error introduced by more steps is less than the compounding error of sub-optimal token sampling, I would expect a better result.

I think your choice of "wrong" is extreme, suggesting such a token can catastrophically spoil the result. The modern reality is more that the model is able to recover.

link