|
|
|
|
|
by mitbal
3477 days ago
|
|
That's very interesting case. In my company, we would also like to optimize email marketing campaign using RL. However, based on my little experience using RL, (please correct me if I'm wrong) wouldn't it take long to iterate and update the V and policy function (or Q function if we use Q-learning), so I'm a bit skeptical if it can be used for real world case where we need to wait days to get the email response as feedback from the environment. |
|