Hacker News new | ask | show | jobs
by hlieberman 356 days ago
Wouldn't the correct tool here be a multi-armed bandit optimization, like an epsilon-greedy algorithm?