|
|
|
|
|
by YeGoblynQueenne
2709 days ago
|
|
>> All I see in this post is complaints and no real solutions. The solution that's given is what? Have less data? The solution is to direct research effort towards learning algorithms that generalise well from few examples. Don't expect the industry to lead this effort, though. The industry sees the reliance on large datasets as something to be exploited for a competitive advantage. >> You can reduce it via PCA one of the many techniques in multivariate statistic. PCA is a dimensionality reduction technique. It reduces the number of featuers required to learn. It doesn't do anything about the number of examples that are needed to guarantee good performance. The article is addressing the need for more examples, not more features. |
|
This is only true for the Facebooks and Googles of the world. There are definitely small companies (like the one I work for) trying very hard to figure out how to build models that use less data because we don't have access to those large datasets.
The industry is larger than just the Big N.