Hacker News new | ask | show | jobs
by bhaak 18 days ago
Who ensures it followed the specs?

The more context an LLM gets, the more likely it will start to ignore instructions.

If the LLM runs a context compression, all bets are off. There's a reason Anthropic upped the context to 1M tokens to reduce the chance of this from happening.

2 comments

> Who ensures it followed the specs?

The human. But only if you care about verification.

The human is missing form OP's description. "and it fills in the implementation". No human in sight.

You can't call it "engineering" if you don't care about verification.

If you build a bridge, the engineers aren't the one doing the welding and crane operation and bolts and digging holes and whatnot.

They're the ones checking that work matches the plan.

Come on, now. The human writes the plan up front, which includes guidance on testing strategy, classes of tests, particular test cases to cover, etc. And just like normal, of course you don't just ship the code without doing manual verification, code review, auditing the test cases, and all the rest.
> Who ensures it followed the specs?

I mean, it's the same with building a bridge in the real world, right?

Someone has to check the work.