| HN Mirror

Y	Hacker News new \| ask \| show \| jobs


	by peab 35 days ago
	One of the bottlenecks right now is testing. The LLMs suck at testing still, so the feedback cycle still requires human input

1 comments

imdsm 35 days ago

That's because testing is a layer above, and that requires effort from people, and people don't want to put in the effort or lack the skills to properly test with them. I think LLMs are superb at testing, but I have a lot of machinery in front of them to enable it

link

peab 35 days ago

They're not good at testing things that require a high visual frame rate (i.e animations), high fidelity (small details), or are out of the normal distribution of things they've seen in the past (without comprehensive prompting)

link

ponector 35 days ago

LLMs are superb at testing easy things. Small website, simple lib, etc.

Same way as LLMs cannot code anything complex, they cannot test complex scenarios.

link

falcor84 35 days ago

Where is the "cannot" coming from?

The only underlying rule that I see is: the more complex a task is, the lower the probability of success. That's it. And it's the same exact rule that applies to humans. The difference though is that we're seeing an exponential growth in AI capabilities, which are then rapidly disseminated on a scale of months, and we don't see such capability increases in humans.

As I see it, anyone whose mental model is built on what AI can or cannot do is going to have a very bumpy decade.

link

ponector 35 days ago

Where is the "can" coming from? We've seen failed attempts to persuade LLMs to crated a browser, a compiler.

link

theshrike79 32 days ago

"A browser" is complex AND complicated. Billion dollar companies build those for a reason.

It's like saying nobody should build a car from scratch because they'll never win the F1 championship :D

link

satisfice 35 days ago

AI cannot take responsibility. It will not indemnify you. My decade is going just fine.

link

falcor84 35 days ago

I don't get it. How is knowing that AI itself will not take responsibility or that it will not indemnify its users make your decade any better? Is your line of reasoning that using AI will very likely prove to be a bad idea in the long term and thus perfectly rational people with the same priors as yours will abstain from training or utilizing AI?

link

satisfice 35 days ago

I am saying that AI cannot take responsibility. My line of reasoning assumes that you understand that responsibility/accountability is a crucial element in offering products and services in society.

Although software companies like to disclaim all warranties to the extent the law allows, neither the market nor the FTC will allow you to breach contracts, make outrageous false claims, or offer products that destroy life and property.

AI cannot and will not behave responsibly. I’m not opposed to AI, but I use it as a tool that I supervise. I review and test what it produces. People and companies that fail to do that will ruin their reputations, I predict.

link

throwaway2037 35 days ago

    > but I have a lot of machinery in front of them to enable it

Can you share some details?

link