|
|
|
|
|
by easygenes
408 days ago
|
|
Mini batch and streaming GD make the benefits obvious and trivial. Closed form OLS is unbeatable so long as samples * params^2 is comfortably sitting in memory. You often lose that as soon as your p approaches 10^5, which is common these days. Soon as you need distributed, streaming, or your data is too tall and or too wide then first order methods are the point of call. |
|
What are the common cases for 10^5 parameter OLS? Perhaps something like weather models could include such computations?