|
|
|
|
|
by snthpy
33 days ago
|
|
> We found that high-quality constitutional documents combined with fictional stories portraying an aligned AI can reduce agentic misalignment by more than a factor of three despite being unrelated to the evaluation scenario. tl;dr Fairy Tales are an effective teaching tool in vivo et in silico |
|