| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by pu_pe 3 hours ago
	It's true that no one is trying to one shot anything serious right now, but it's still an important metric. Claude Code and Opus really took off when they improved the harnessing enough that it would self-correct many of its mistakes without needing user input. In fact I think long-term autonomy (in the range of several hours) and self-correcting is going to be where we see most improvements in coming years.

2 comments

bogtog 2 hours ago

> In fact I think long-term autonomy (in the range of several hours) and self-correcting is going to be where we see most improvements in coming years.

Right, model intelligence defines the scope of things they can one shot

I also suspect that users naturally calibrate to a model's useful scope, gradually getting positive/negative feedback and gradually making their requests bigger/smaller than before

link

dakolli 3 hours ago

it wont happen, its all a money grab.

link

OtomotO 3 hours ago

I think that LLMs will stay, but I also think we've plateaued and that big companies will fail and fall and we will have another years long "halt" of any real advancements coming to the public.

Similar to how ML was all the hype about 12 years ago and then it submerged again for a couple of years.

link