Hacker News new | ask | show | jobs
by visarga 40 days ago
> I do not want to rediscover for the hundredth time that in fact all this time an agent took shortcuts for acceptance tests I rely upon and didn’t catch. Or once again get the agent to understand why and what I want it to do after its context got bloated and it start to drift completely.

100% agree, neither do I, but I see this as an opportunity to think "how can we gain trust in the outputs AI produced for us?"

Is it about tests, reviews, some methodology? Better observability? Formal specification? It's really interesting to think how you can relieve this pain. I think the answer to this question will show the path ahead for agentic coding.