Hacker News new | ask | show | jobs
by pydry 3 hours ago
Every time I read in some blog about an unproven technique which is profitable for token sellers I'm reminded of those overpriced restaurants that became "instagram popular" because the cool kids got paid a bundle of money to promote them.

In real life the only time I saw somebody try this "multiagentic coding" the results were...underwhelming.

2 comments

(Op here)

I genuinely think that multi-agent is a probable future to enable coding at the scale of a big corporation.

I agree and I did not see it work yet, but the trial were most likely on small scale where it is simply over engineering.

(Btw : I do not sell tokens. I I think distributed the work through agents in a plateform is a way to control costs by optimizing specialised agents)

I like the topic and I think orgs are struggling with the question:

What do our teams look like now?

But I have some big concerns with your approach here. This post is written like an authoritative summary but you admit it's not been seen working. Why is there so much untested conjecture presented as best practice here? If you had tested it you would realize this proposal is not possible in most orgs. Their "platform" will not be extensive enough to prevent misshaps by teams comprised of non engineers.

Because as a consultant I see that this model (TT) is a proven solution to the problems of cognitive load that prevent models to scale. It will require some adaptations indeed, but I trust that this is a missing piece in the integration of AI in organisations. But I get your point and this is the reason it share it on a personal blog and not on my company blog or in a more trustworthy source of truth.
Please take my comments as friendly suggestions in your pull request. I am not intending to shoot you down friend.

I don't think you should avoid sharing it in any forum. Like I said it seems like a reasonable idea but I would just suggest being very I suppose blunt in 2026 because people skim and won't read things thoroughly. I would preface the article with "this is not battle tested"

Lest someone be frustrated when their stream aligned team accidentally exposes your whole company's email addresses on their new web app that whoops they forgot put it behind a login.

And thank you for your comment, no offense taken. I post here to get valuable feedback like yours.
It would be nice to see some metrics. I think the missing layer here is evaluation. If agents are going to produce applications, the platform needs not only guardrails, but public-ish evidence that those guardrails actually catch failures
I fully agree
In real life only a minority of teams achieved good results according to Circle CI report.

And there is one notable anecdotal source proving that it is possible - https://ideas.fin.ai/p/2x-nine-months-later