|
|
|
|
|
by sarpdag
523 days ago
|
|
I really like multi armed bandit approach, but struggles with common scenarios involving delayed rewards or multiple success criteria, such as testing ecommerce search with number of orders and GMV guardrails. For simple, immediate-feedback cases like button clicks, the specific implementation becomes less critical. |
|