Hacker News new | ask | show | jobs
by AlexCoventry 165 days ago
What's the advantage of having multiple channels with separate residual connections? Why not just concatenate those channels, and do residual connections on the concatenated channel?