| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by bhaak 18 days ago

Who ensures it followed the specs?

The more context an LLM gets, the more likely it will start to ignore instructions.

If the LLM runs a context compression, all bets are off. There's a reason Anthropic upped the context to 1M tokens to reduce the chance of this from happening.

2 comments

not_a_bot_4sho 18 days ago

> Who ensures it followed the specs?

The human. But only if you care about verification.

link

bhaak 18 days ago

The human is missing form OP's description. "and it fills in the implementation". No human in sight.

You can't call it "engineering" if you don't care about verification.

link

ceejayoz 18 days ago

If you build a bridge, the engineers aren't the one doing the welding and crane operation and bolts and digging holes and whatnot.

They're the ones checking that work matches the plan.

link

montroser 18 days ago

Come on, now. The human writes the plan up front, which includes guidance on testing strategy, classes of tests, particular test cases to cover, etc. And just like normal, of course you don't just ship the code without doing manual verification, code review, auditing the test cases, and all the rest.

link

ceejayoz 18 days ago

> Who ensures it followed the specs?

I mean, it's the same with building a bridge in the real world, right?

Someone has to check the work.

link