Hacker News new | ask | show | jobs
by cdrini 703 days ago
I disagree with the statement "they are trained to duplicate" because "to" implies a purpose/intent which is incorrect. I.e. "they are trained with the purpose of duplication". This is I believe pretty uncontroversially false. We already have methods to duplicate data. They are trained with the purpose of learning abstract patterns is much more correct. One of the biggest _problems_ of training is duplication, aka over-fitting. To say it's the purpose is imo disengenious.

Ah I see what they meant by that statement. It is true that supervised learning operates on labelled input/output pairs, and that neural networks generally use gradient descent/back propogation. (Disclaimer: it's been a few years since I've done any of this myself so don't quite remember it that well, and the field has changed a lot). Note since the parameter space of the neural network is usually _significantly_ smaller than the training data set, a network will not tend to minimise that cost function near 0 for an individual sample since doing so will worsen the overall result. There is inherent "fudging", although near identical output can potentially happen. The statement here is more reasonable and similar to the training process than the first.