Hacker News new | ask | show | jobs
by utopkara 2722 days ago
You usually do not even want a global optimum with a high capacity model such as a DNN anyway (SVMs memorize only a small number of data points), because that possibly means that you are overfitting to your training dataset.