Hacker News new | ask | show | jobs
by WithinReason 783 days ago
Looks very interesting, but my guess would be that this would run into the problem of exploding/vanishing gradients at larger depths, just like TanH or sigmoid networks do.