Hacker News new | ask | show | jobs
by hollosi 113 days ago
Enforcement is the real issue, not the specific red lines, regardless of what Anthropic claims and news outlets repeat.

Verification requires access to classified logs. These logs would attract the spies of the whole world. Even if these logs are in principle for "past actions", in practice past logs (for war games, for example) would compromise future strategy.

Since these manual audits are too risky, the only alternative is to hard-code limits into the AI. But are we ready trust an AI to "judge" a mission and refuse to execute during a crisis?

Anthropic wanted technical enforcement, the Pentagon wanted trust.

It’s a choice between two bad options: an unaccountable military and an unreliable AI kill switch. They are both very dangerous, just in different ways.