Hacker News new | ask | show | jobs
by treprinum 1000 days ago
In theory you need two layers to model any function. In practice this is wildly different.
1 comments

Any memoryless continuous function between two Euclidean spaces, I think you mean. The experts-and-manager model would need to be able to do more than that (as do most neural networks).

And part of the reason why single-hidden-layer networks aren't enough even in continuous memoryless Euclidean cases is, again, because of how loss functions work; you're unlikely to converge on a good approximation with very few hidden layers.