Hacker News new | ask | show | jobs
by flyers_research 1826 days ago
What are the size of the datasets? I have a hard time conceptualizing tabular business data large to be a problem.
2 comments

consider the problem of "online advertising"
When you have billions of rows the performance savings can be nice.
One of my projects several years back ran both a LR model and a DNN against the same input data (albeit featurized differently). Accuracy, P&R were roughly the same (minor differences depending on the time horizon), but the LR model took maybe a half hour to train and five minutes to run; the DNN took about 24 hours to train and an hour or two to run.

This wasn't even particularly huge data compared to my other projects. But certainly at that scale, there are huge differences between regression & NNs.