|
|
|
|
|
by zeroCalories
522 days ago
|
|
What do you think of using the epsilon-first approach then? We could explore for that fixed time horizon, then start choosing greedy after that. I feel like the only downside is that adding new arms becomes more complicated. |
|