Hacker News new | ask | show | jobs
by sanxiyn 2399 days ago
Constrained RL is a way to say "thou shalt not murder", instead of saying "murder is utility -10000".