|
|
|
|
|
by VHRanger
3160 days ago
|
|
> There is so much focus on the machine learning algorithms rather than getting data ready for the algorithms. Generally, once a problem at work has come to the point of being a "kaggle problem", it's trivially easy. The main problem is unstructured data, with infinite ways of specifying similar ways to measure the same attribute, and lots of leeway to build an unmaintainable data pipeline between the data generation process and the model at the end. |
|
Others are far more complex and start with much messier data and/or complex formulations.
Examples:
- www.kaggle.com/c/nips-2017-non-targeted-adversarial-attack/ - www.kaggle.com/c/the-allen-ai-science-challenge