Hacker News new | ask | show | jobs
by solomatov 1252 days ago
Naive RNN have vanishing gradient, but LSTMs and GRUs are much better in this respect.
1 comments

While this is true, and was a major advantage of LSTMs/GRUs, they still suffer from vanishing gradients.

w.r.t proteins, our sequences often surpass 1500 amino acids and that is really tough for an LSTM to stably train on.