Hacker News new | ask | show | jobs
by robertsdionne 617 days ago
These are real RNNs, they still depend upon the prior hidden state, it’s just that the gating does not. The basic RNN equation can be parallelized with parallel prefix scan algorithms.