| HN Mirror

To "compute a neural network" is a long-established way to say "train a neural network", which in turn is a long-established way to say "find a set of weights for the neural network that maximises its accuracy".

The idea is that a neural net is a kind of data structure used in AI, like a decision tree or a decision list (like a decision tree but it's a list). There are different algorithms that can "compute", i.e. construct, a decision tree from data. In modern parlance we say that the decision tree is "trained". Same goes for neural nets, except the network itself is typically constructed beforehand, and manually (we refer to it as the "architecture" of the neural net) and only its weights need to be tweaked until it has a good accuracy- at which point we say the training algorithm has "converged".

It's all a bit confusing because in common parlance there is little distinction made between a neural net's network (its architecture), the algorithm that trains the neural net by finding the weights that minimise its error (backpropagation) and the neural net with trained weights (the "model"). Sometimes I wonder if this distinction is clear in the minds of people who actually train those things.

Btw, the study is solid and meaningful. It's a theoretical result. More of those are needed in machine learning, we got plenty of empirical results.