Hacker News new | ask | show | jobs
by unchocked 37 days ago
This lowers p(doom) for me.

It makes sense that reinforcement learning on reasoning about coherent principles should bias toward principled action in real situations.

Probably also illuminates moral interpretability.