Hacker News new | ask | show | jobs
by zodiac 3670 days ago
Did you use any regularization? Did you train until the training accuracy converged?

I feel like the network has enough capacity to overfit 180 training samples

1 comments

I experimented with a few regularization factors and ultimately settled on a lambda of 0.1. Rather than using a stopping criteria, I ran a fixed number of training iterations (~100) and just eyeballed the cost function results. Since my total training time was fairly brief (~2 minutes, tops), I had the luxury of designing the ANN somewhat heuristically.