Hacker News new | ask | show | jobs
by WithinReason 90 days ago
These layers are residual layers, so what a layer does is:

x = x + layer(x)

so it's not too surprising that they can be used recurrently

1 comments

Ah! Thank you