Hacker News new | ask | show | jobs
by OkayPhysicist 627 days ago
The most popular RNNs (the ones that were successful enough for Google translate and the like) actually had this behavior baked in to the architecture, called "LSTMs", "Long-Short Term Memory"