Hacker News new | ask | show | jobs
by pakl 3467 days ago
If the shadow is predictable then the unsupervised (self supervised) part of our Predictive model will not have too much trouble learning to deal with it. But you're right, if it did have trouble, there would be good reason to for the system to continue exploring that type of event.

One way to do this is to link PVM'a prediction error output to an "instinct" that directs it towards lower confidence events that you mentioned. This could be just orienting the camera towards those events, or in a robot producing actions that led to those events again.

Note this is not necessarily reinforcement learning but it relates to some ideas there, like the idea of novelty being rewarding.