Hacker News new | ask | show | jobs
by visarga 1097 days ago
We create representations from our sensorial data and values from our reward signals. We are part of a larger system and our actions are filtered and rewarded by the environment. We feed on this system to learn, basically we train on it.