|
|
|
|
|
by ced
5226 days ago
|
|
I was a fan of Gelman-style Bayesian modeling for a while, but it's a lot of work when you don't have a True Model. Have you tried machine learning techniques? You have a ton of predictors available for each customer --- gender, age, price of the first purchase, whether they bought before Christmas, etc. None of those fit neatly into standard models. Why not throw them all into one big linear regression for CLV, then move on to more sophisticated techniques? |
|
The problem with linear regression (or any machine learning technique) in CLV prediction is extrapolation. Since we are making a predictive measure, we are projecting out what customers will spend in the future.
You can't construct a valid training dataset to project a 2 year CLV if you business is not two years old. You need to make some assumptions about how customer's ordering will continue. We have found that assuming that customer behavior follows the latent attrition model is a robust assumption for most of our clients.
Even if the business has been around for a long time, we have found that customer behavior tends to change over time. In that early adopters are often far more valuable than more recent customers, so using the earlier adopters as the training set leads to misleading results.
To use a machine learning framework, you need to make the assumption that customers who join recently strongly resemble customers with similar attributes who joined in the past. We have found that this is often not a valid assumption, and so we make the simplifying assumptions about customer behavior to add power to our models.