Hacker News new | ask | show | jobs
by kevindamm 442 days ago
That is right, there is usually a parameter on the action selection function -- the exploitation vs exploration balance.