Hacker News new | ask | show | jobs
by sdenton4 3005 days ago
afaik, pure DNN models still lag seriously behind 'traditional' HMM-based frameworks augmented by neural networks (using DNNs for specific parts of the pipeline). Last I checked a couple months ago, state of the art for HNN+DNN was something like 6% word error rate (WER). The best Seq2Seq DNN I know of hit 18% WER, dropping to 10% when a secondary language model was integrated in. (my guess is that part of the problem is leaning too heavily on the attention mechanism... a more 'streaming friendly' framework should help reduce the load on the attention mechanism.)

https://arxiv.org/pdf/1610.03022.pdf

1 comments

This has changed recently, full seq2seq is now matching hybrid models [0].

[0] https://arxiv.org/abs/1712.01769

Oh, thanks! Now I know what I'm reading on the commute tomorrow.