Hacker News new | ask | show | jobs
by anon1253 2698 days ago
Love it. Especially the echo state network trick. I wonder how much of BERT/ELMO performance is simply due to them having a such a high dimensionality. Not that there is anything wrong with that, just makes a tad less practical for some applications.