Hacker News new | ask | show | jobs
by stratoatlas 62 days ago
This feels different from prompt injection.

If the router modifies tool calls after the model already produced output, then the model isn't the failure point anymore — the transport layer is.

Is there any mechanism today that guarantees integrity between model output and what the client actually executes? Or are we relying entirely on trust in the routing layer?