Hacker News new | ask | show | jobs
by cuchoi 2501 days ago
Very exciting to see causal theory being productionized!

From the article, this seems like a normal regression to me. Would be interesting to know what makes it causal (or at least better) compared to an OLS. PCA has been used for a long time to select the features to use in regression. Would it be accurate to say that the innovation is on how the regression is calculated rather than the statistical methodology?

Either way, it would interesting to test this approach against an A/B test and check how much an observational study differs from the A/B estimates, and how sensitive is this approach to including (or not) a set of features. Also would be interesting to compare it to other quasi-experimental methodologies, such as propensity score matching.

Is there a more extended document explaining the approach?

Good luck!

1 comments

Yes, you're correct that the underlying algorithm used is very close to OLS. What allows the regression to provide an estimate for average treatment effects is how it is structured. Namely, adding in pre-treatment confounders as well as interactions between the treatment and confounders. I found this chapter (http://www.stat.columbia.edu/~gelman/arm/chap9.pdf) on causal inference does a good job of outlining the approach.

Yes, we actually explored other approaches such as PSM. The main reason we did not initially go with PSM was because of the compute power required - you would need to train a model for each treatment variable. However, we're actually in the midst of developing a way to train a model for each treatment variable efficiently, which will allow us to add items such as inverse propensity weighting (or explore other approaches such as PSM).

This approach only works if all confounders are known, which is never the case in practice, so the model you fit is correlational and not suitable for causal inference. Propensity matching suffers from the same issue if the propensities are estimated from the same features. If not all confounders are known, you must be able to find instrumental variables to build a causal model.
Thanks for answering!