| HN Mirror

Y	Hacker News new \| ask \| show \| jobs

by czl 235 days ago

> when you have some supremely intelligent agent acting on the world, even a small misalignment may end up in catastrophe

Why not frame this as challenge for AI? When the intelligence gap between a fully aligned system and a not-yet-aligned one becomes very large, control naturally becomes difficult.

However, recursive improvement — where alignment mechanisms improve alongside intelligence itself — might prevent that gap from widening too much. In other words, perhaps the key is ensuring that alignment scales recursively with capability.