| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by Jang-woo 84 days ago

That's a good point.

I think you're right that at the model level, competition pushes toward "always say yes."

What I'm wondering about is whether control needs to exist at a different layer — not in the model itself, but in the system that decides whether actions are allowed to execute.

In other words, even if a model is willing to say "yes," the system using it might still need to decide whether execution is permitted.

Otherwise, it feels like we're relying entirely on model behavior for safety, which seems fragile in competitive environments.