|
|
|
|
|
by igorkraw
2727 days ago
|
|
If i remember my Goodfellow correctly (and quickly checking, wikipedia, I did https://en.wikipedia.org/wiki/Universal_approximation_theore... ), there is a nuance here which is almost always missed: you can represent any function with a sufficiently wide 2 layer neural network, it doesn't say anything about being able tune the network until you find a correct setting (i.e. learnability). This is important. Flippantly said,discarding learnability and speed of convergence, you can get the power of any neural network by the following algorithm: 1. Randomly generates a sufficiently wide bit pattern
2. Interprets it as a program and run it on the test set
3. discard results until the desired accuracy is reached |
|