|
|
|
|
|
by ben_w
311 days ago
|
|
Right now the deeper problem is that nobody really knows how to reliably and deliberately make AI that's aligned with anything human-like. As in: it's basically a nice happy accident that LLMs are only sycophantic/fawning and don't normally (ahem, Grok) try to undermine us all like edgy internet trolls. If we could make them follow even exactly 6 (for sake of argument, no more, no less) of Anton LaVey's Eleven Satanic Rules of the Earth*, and do so reliably instead of the ethics equivalent of a shrug and "LGTM, merged", this would be a big development and make people a lot more comfortable about open models that can do decent work with chemistry or biology. * https://churchofsatan.com/eleven-rules-of-earth/ |
|