Hacker News new | ask | show | jobs
by snthpy 33 days ago
> We found that high-quality constitutional documents combined with fictional stories portraying an aligned AI can reduce agentic misalignment by more than a factor of three despite being unrelated to the evaluation scenario.

tl;dr Fairy Tales are an effective teaching tool in vivo et in silico