Y
Hacker News
new
|
ask
|
show
|
jobs
by
jimfleming
2635 days ago
Highway connections are actually used heavily in NLP in combination with LSTMs.
1 comments
slashcom
2635 days ago
And replaced by residual connections in transformers, which are absolutely dominating LSTMs now.
link
stochastic_monk
2635 days ago
Transformer-XL uses recurrence, and most NLP SOTA is still with LSTMs. I’m not sure I’d expect attention mechanisms to fully replace recurrence.
link