|
|
|
|
|
by fxtentacle
2193 days ago
|
|
They mean that if you add parameters, the learning capability of their approach grows by a similar amount as if you would add the same number of parameters to a conv+ReLu network (the standard approach). That "universal" is a weird claim in my opinion, but they mean that with enough parameters, this architecture can learn everything. |
|
Yes, the universal approximation is a strong claim. NN has been proven to have universal approximation theoretically.