Hacker News new | ask | show | jobs
by ben_w 311 days ago
Right now the deeper problem is that nobody really knows how to reliably and deliberately make AI that's aligned with anything human-like.

As in: it's basically a nice happy accident that LLMs are only sycophantic/fawning and don't normally (ahem, Grok) try to undermine us all like edgy internet trolls.

If we could make them follow even exactly 6 (for sake of argument, no more, no less) of Anton LaVey's Eleven Satanic Rules of the Earth*, and do so reliably instead of the ethics equivalent of a shrug and "LGTM, merged", this would be a big development and make people a lot more comfortable about open models that can do decent work with chemistry or biology.

* https://churchofsatan.com/eleven-rules-of-earth/