Hacker News new | ask | show | jobs
by seventytwo 2143 days ago
Not sure I understand the importance of this...
1 comments

Indeed, CA can be represented by simple combinations of boolean functions, obviously by NN also, which is a combination of similar nonlinear functions.
A wide enough NN can represent any arbitrary binary function, but it's not obvious that one can learn it.
Yet the best NNs are deep, not wide.
What do you mean by 'the best'? Deeper architectures are popular because they quiet easy to train. They do work well in practice on many tasks (especially vision) but they have their limits.

Infinite wide networks are a newly active field and has recently shown some promising results, theoretically [1, 2] and empirically [3].

[1] https://arxiv.org/abs/2001.06931 [3] https://arxiv.org/abs/1806.07572 [2] https://ai.googleblog.com/2020/03/fast-and-easy-infinitely-w...