Hacker News new | ask | show | jobs
by js8 849 days ago
Well, in reinforcement learning, wrong outputs are inherently painful for the AI. If the system expects getting negative feedback regardless what it does, I would say it is suffering. I would also define torture as giving it negative feedback for any possible output.