| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by jandrese 479 days ago
	Yeah, that's true in many fields with these AI agents. They demo well, but when you put them to actual work they fall right on their face. Even worse, the harder the task you set for them the more they lie to you. It's like hiring a junior dev from one of those highly regimented societies where it's more important to save face than to get the job done.

2 comments

Xelynega 479 days ago

It's almost as if they're not trying to market to the people actually using the products, but trying to convince investors of features that don't exist

link

alfalfasprout 479 days ago

Yep it's "full self driving in 1 year" all over again.

link

ilrwbwrkhv 479 days ago

Its the good old Elon musk playbook spread out across the industry.

link

brookst 479 days ago

Someone should coin a term for this very new phenomenon. Maybe “vaporware”?

link

aprilthird2021 479 days ago

Your last sentence feels kind of spot on. The lack of transparency around confidence in the answer makes it hard to use (and I know it would not be simple to add such a thing)

link

hackernewds 479 days ago

sounds like a skill issue to be honest. you could probably tell the assistant to just ask you questions when information is missing instead

link

dimitri-vs 479 days ago

Have you actually tried this? What happens is it will very often ask you questions at irrelevant times so you start ignoring the questions and it becomes wasted space.

Even OpenAI hasn't figured it out, because their Deep Research always asks questions before starting the search.

link

ryoshu 479 days ago

Programming is easy. Asking the right question is hard.

People don't know what questions to ask.

link

aprilthird2021 479 days ago

But it doesn't know when information is missing

link