Hacker News new | ask | show | jobs
by singularity2001 212 days ago
While collecting data according to policy is part of RL, 'reductive' is an understatement. It's like saying algebra is all about scalar products. Well yes, 1%